I wonder...

Friday, May 16, 2025

Oracle Documentation and Advantages of Reading Other Resources

After I published the first post of the series (#1, #2) I received a private request to talk a bit more about 11.2-to-19c upgrade. Considering that frankly speaking that's the most common upgrade case nowadays (well, one long-term release to the other one!) I can definitely understand the importance of real-world experience for anybody who plans to do the same.

So, here is the story - or to be precise two stories.

Production/Stage/External Test - these ones were handled by our partner DBAs, but the idea was pretty straightforward: copy cold backup to the new environment + run in-place upgrade (non-CDB). Same OS (Linux-to-Linux), preserving of character sets etc. I.e. everything is well-documented and clear. So, nothing much to write about.
Development and Internal Test - that one was a real head-scratcher that I'd like to share with everybody. The task was as follows:

Starting points: Windows 2008 Server with 11.2.0.4 database and Win1252 charset, Windows 2012 Server with 19c database win Win1252 charset (non-CDB). Each DB is about 200 GB
Required destination: Amazon AWS Linux instance with 19c database with two environments as PDBs that would allow creating ALUTF32 PDBs if needed.

As you can see, a lot of transformation and less-then-common transfer path. One of the reasons - you just can't install 19c database to run in-place upgrade using AutoUpgrade and other nice things on Windows 2008 server. So, with the help of our system administrators I came up with the plan that eventually lead to success. And that's where "other-then-documentation" resources really helped - some pieces of knowledge I took from Tim Hall's Oracle-Base, some pieces are from Mike Dietrich, some pieces from other people just posting their ideas and experiences in their blogs and various online resources. Overall, without the community I would experiment forever! And that's why I like so much to be part of this knowledge-sharing environment - and try to contribute myself.

I started from the bottom - discussed with sysadmins a good and reliable way to moving something to AWS. The answer I've got was interesting: there was a simple process to port existing virtual machine from our internal cloud to AWS. That made my life much simple - it meant that we could build a local VM, play with it as much as needed (with a very easy way to create multiple saverpoints/restore to any point etc.) and eventually declare it ready to be transferred. Ok, admins got Oracle Linux ISO and made me in vSphere a new server - now I've got a destination.

Next step was to see whether my understanding of OS internal was true and it was possible to move PDB from Windows to Linux without any major hustle... To do that I first converted my 19c instance from non-CDB to a PDB within newly created PDB. That was just a couple of commands:

-- non-CDB instance

shutdown;

startup open read only

BEGIN

DBMS_PDB.DESCRIBE(

pdb_descr_file => 'f:\oradata\test01.xml');

END;

shutdown immediate

-- CDB instance

CREATE PLUGGABLE DATABASE test01

using 'f:\oradata\test01.xml'

NOCOPY

tempfile reuse

ALTER SESSION SET CONTAINER=test01;

@$ORACLE_HOME/rdbms/admin/noncdb_to_pdb.sql;

ALTER PLUGGABLE DATABASE OPEN;

Now it was time to move PDB to Linux box, that already had CDB configured as AL32UTF... And after not finding the answer on whether file_name_convert operator can change Windows-style file name to Linux-style I just went into moveText.xml file and manually updated all locations to which I transferred PDB files (btw, later I tested the operator - and I couldn't make it do what I wanted, so, hint: sometimes manual approach is your choice ;) )

-- Windows CDB

ALTER PLUGGABLE DATABASE test01 CLOSE;

ALTER PLUGGABLE DATABASE test01 UNPLUG INTO 'F:\moveTest.xml';

-- Linux CDB

CREATE PLUGGABLE DATABASE test01

using '/home/oracle/test01_updated.xml' -- Linux locations

NOCOPY

tempfile reuse

ALTER SESSION SET CONTAINER=test01;

ALTER PLUGGABLE DATABASE OPEN;

The last command finished - but with the warning. It was time to review existing violations

select *

from pdb_plug_in_violations

where name = 'TEST01'

and status !='RESOLVED'

The most only errors I found was the mismatch of patches - and that was simply fixed by running datapatch - verbose on the Linux side: the system easily rolled back all of the incorrect patches within PDB and placed what was needed. Seemingly the system was ready to operate - I could query data, run some sample procedures... But the application didn't start! After some digging I found a somewhat unexpected reason - even though you could move Windows PDB to Linux, if you are using PL/SQL objects with Native compilation - you have to recompile everything (well, makes sense, different platforms = different low-level code!). The following loop did that:

begin

for c in (

select owner, name, type

from dba_plsql_object_settings

where plsql_code_type !='INTERPRETED'

)

loop

begin

if c.type = 'PACKAGE BODY' then

execute immediate 'alter '||c.type||' ' ||c.owner||'.'||c.name||' compile body';

else

execute immediate 'alter '||c.type||' ' ||c.owner||'.'||c.name||' compile';

end if;

exception

when others then null;

end;

end loop;

end;

That was easy enough - so I decided to replicate exactly the same pattern for the main development database (that was 11.2.0.4). So, in my Test database I dropped PDB Test01 with all datafiles (I had to do it because Dev files names overlapped with Test01 and I didn't want to shuffle anything around) and used good-old-and tried approach:

Original DB

Create PFILE from SPFILE
Run preupgrade on the source and follow its recommendations
Shutdown the instance
Transfer files to the new location preserving the file structure (as for the cold backup):

ORADATA\ora11g folder
FAST_RECOVERY AREA\ora11g folder with all corresponding directories
PFILE
Control files from other locations (btw, you can integrate them into the same folder by modifying PFILE)

Server with 19c

Create new instance:

orapwd file=? password=? format=12 (lower format if you want simple password for now)
oradim
startup upgrade

Upgrade via dbupgrade utility
Run post-fixes, including DST upgrade
Recompile whatever needed and resolve obscure cases if exist (Java classes, SDO_Geometry etc.)

Now I was pretty exactly where I started with Test environment - non-CDB instance + empty container. So, the following steps are exactly above: convert instance into PDB, plug it in as Dev01, upgrade, fix PL/SQL native-compiled objects. That's all, folks - two environments became one on the different platform! After that it was just lift-and-shift to AWS - and we've got our new environment.

Summary: if you can use somebody else's experience - use it! Oracle documentation, especially about upgrades is huge, but sometimes you may be looking for something very specific and it may be hard to detect exactly your use-case in the superset of the knowledge.

P.s. VM Snapshots are your saviors (that's why I love the cloud). You can't imagine how many various issues I've got in process - and without snapshot recovery it would've take wa-a-a-a-y longer to do anything meaningful.

Wednesday, May 14, 2025

Oracle Documentation and Dangers of Not Reading It

In addition to yesterday's post - a bit of old-timer knowledge share: if you see some features for years, you have a tendency to assume how they work. And often enough to your own surprise "the owls are not what they seem" (c) :)

My favorite story here is about hierarchical queries with CONNECT-BY clause. For years I was wondering why every generation of developers was having various performance issues writing such queries... Recently, while working on the presentation about recursive structures in the database I came to realization of the following: people look up code snippets without going any further!

Let's take a basic problem based on HR schema - list all employees working for the specified department in the hierarchical order starting from its manager. A lot of junior developer would write something like that:

select *
from hr.employees
where department_id = 100
start with job_id like '%MGR%'
connect by prior employee_id = manager_id

Result of the query would be functionally correct - so, developers would start using that pattern... And eventually (when they would start working with large data sets!) that code becomes the source of the major performance degradation! Here is why - let's take a look at the explain plan:

------------------------------------------------------------------------------

------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 34 | 4522 |

|* 1 | FILTER | | | |

|* 2 | CONNECT BY NO FILTERING WITH START-WITH| | | |

| 3 | TABLE ACCESS FULL | EMPLOYEES | 107 | 7383 |

------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

1 - filter("EMPLOYEES"."DEPARTMENT_ID"=100)

2 - access("EMPLOYEES"."MANAGER_ID"=PRIOR "EMPLOYEES"."EMPLOYEE_ID")

filter("JOB_ID" LIKE '%MGR%')

Even though we have lots of indexes on the table, we've got FTS - and if we look closer, we will notice that our department filter was applied at the very end. It seems that Oracle first walked the whole table hierarchically - and after that threw out all rows that do not belong to the department #100!

And indeed that's exactly what Oracle documentation has been telling us for many years:

Indeed non-Join predicates are evaluated only afterwards. In a lot of cases it indeed makes sense - depending on the conditions. But in THIS case, where we know (and that's the key knowledge that Oracle is missing) that hierarchies never cross departments - although we forgot to tell Oracle about it! So, no wonder, the result is les then optimal.

How to fix it? There are two approaches to solve this problem that both significantly improve performance, but provide slightly different executions plans.

Option #1 is to use in-line views (to be fair, that's my favorite approach, because it makes code way more readable):

select *

from (select *

from hr.employees

where department_id = 100)

start with job_id like '%MGR%'

connect by prior employee_id = manager_id

-------------------------------------------------------------------------------------

-------------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 2 | 266 |

|* 1 | CONNECT BY NO FILTERING WITH START-WITH| | | |

| 2 | TABLE ACCESS BY INDEX ROWID BATCHED | EMPLOYEES | 6 | 414 |

|* 3 | INDEX RANGE SCAN | EMP_DEPARTMENT_IX | 6 | |

-------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

1 - access("EMPLOYEES"."MANAGER_ID"=PRIOR "EMPLOYEES"."EMPLOYEE_ID")

filter("EMPLOYEES"."JOB_ID" LIKE '%MGR%')

3 - access("DEPARTMENT_ID"=100)

As you can see here, explicitly filtering the department #100 in the in-line view immediately dropped number of processed rows and bytes - exactly as needed.

Option #2 is to tell Oracle to start only from rows that belong to the department #100:

select *

from hr.employees

start with job_id like '%MGR%' and department_id = 100

connect by prior employee_id = manager_id

-----------------------------------------------------------------------------------

-----------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 7 | 931 |

|* 1 | CONNECT BY WITH FILTERING | | | |

|* 2 | TABLE ACCESS BY INDEX ROWID BATCHED | EMPLOYEES | 1 | 69 |

|* 3 | INDEX RANGE SCAN | EMP_DEPARTMENT_IX | 6 | |

| 4 | NESTED LOOPS | | 6 | 492 |

| 5 | CONNECT BY PUMP | | | |

| 6 | TABLE ACCESS BY INDEX ROWID BATCHED| EMPLOYEES | 6 | 414 |

|* 7 | INDEX RANGE SCAN | EMP_MANAGER_IX | 6 | |

-----------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

1 - access("EMPLOYEES"."MANAGER_ID"=PRIOR "EMPLOYEES"."EMPLOYEE_ID")

2 - filter("JOB_ID" LIKE '%MGR%')

3 - access("DEPARTMENT_ID"=100)

7 - access("connect$_by$_pump$_002"."prior employee_id "="MANAGER_ID")

It has slightly higher cost an somewhat more complex execution plan - but that way you still have better performance than FTS alternative.

Summary: please, beware - solutions that work may not scale if you don't understand what is going under the hood! Read those manuals, please!

Tuesday, May 13, 2025

Oracle Documentation and Dangers of the AI

Earlier today I presented a webinar for NYOUG on one of my favorite topics - building SQL queries on the fly (btw, if anybody interested - slides and video will recording will be available a bit later):
https://nyoug.org/event/dynamic-mechanisms-of-query-building/

And during the presentation a recent in-house story came to my mind - based on my own remark. I said something like: "If you are using advanced functionality, you really need to know the features you are planning to use - not just copy/paste code snippets from ChatGPT or any other preferred AI tool"

The statement above is perfectly true - and I even got an illustration (kind-of) for it based on the true "mini-scare" that happened in our office.

Here is the story. For many years a lot of my friends were laughing at me when I was telling that one of the biggest (government) projects that is being supported by Dulcian is still on Oracle 11.2.0.4. Finally this year our partners responsible for the administering the system got it to 19c. All is good - but for some internal reasons the production environment is still running with "compatible=11.2.0.4.0"!

Of course, for us (i.e. development shop) it is a major restriction - and I've been very vocal about finally moving that parameter to "19.0.0". Couple of weeks ago in the internal communication I've got a bit panicky email from one of my colleagues: "Depending on how I ask an AI get different answers whether downgrade is possible! Help!" + a couple of screenshots:

#1 Gemini (Google)

#2 ChatGPT

As you can see, one platform says clear "NO" and the other one says that it is clearly possible, but there may be some impact that needs to be evaluated.

After hours of research I can say the following - Gemini is closer to the real answer than ChatGPT. Even though there are empirical evidences that it is possible in some circumstances (different combinations of versions) to lower compatibility parameter, it is strongly not recommended. So much not recommended that in 19c docs it is explicitly written that you cannot go back to 11.2.0.4 at all:

Summary: be VERY careful when you use various AI tools - it could be a good starting point, but without going to the core references you cannot be sure that you are getting real answers. Buyers, beware! :)

Thursday, May 30, 2024

The Power of Proactive Monitoring. Part 6

And talking about benefits... Sometimes if you get advanced monitoring features you can use provided information in a less-then-expected way. Here is one of my favorite examples.

As a lot of people know, some vendors provide their PL/SQL packages in a wrapped style. Also, as a lot of people know, if is often hard to get vendors to accept that something is wrong with their code (rather than with the way this code is being used).

Imagine, you have a slow-running module that takes tables/columns and converts it into formatted CLOB - and before talking to the vendor I want to collect as much data as I can:

SQL> DECLARE
2 v_CL CLOB;
3 BEGIN
4 dbms_hprof.start_profiling ('IO','HProf_Check.txt');
5 v_cl :=wrapped_pkg.f_getdata_cl('object_name','test_tab');
6 dbms_hprof.stop_profiling;
7 END;
8 /

I know that the table has 50k rows - and I see that there is 50k calls to DBMS_LOB.WRITEAPPEND! Yes, I can see what happens inside of the wrapped code - big surprise! Now I have a real evidence that the wrapped code is not efficient: whoever wrote it uses row-by-row approach.

As I wrote long-time ago, LOB operations are expensive (even LOB variables above 32K size are not real variables, but a segment in TEMP tablespace) and causing a lot of resource to be used. I.e. original code is something like that:

FUNCTION f_getData_cl(i_column_tx VARCHAR2, i_table_tx VARCHAR2) RETURN CLOB IS

v_cl CLOB;

v_tx VARCHAR2(32767);

v_cur SYS_REFCURSOR;

BEGIN

dbms_lob.createTemporary(v_cl,false,dbms_lob.call);

OPEN v_cur FOR 'SELECT '||

dbms_assert.simple_sql_name(i_column_tx)||' field_tx'||

' FROM '||dbms_assert.simple_sql_name(i_table_tx);

LOOP

FETCH v_cur into v_tx;

EXIT WHEN v_cur%notfound;

dbms_lob.writeAppend(v_cl,length(v_tx)+1,v_tx||'|');

END LOOP;

CLOSE v_cur;

RETURN v_cl;

END;

But if you use PL/SQL variable as a buffer, you can drastically decrease the number of LOB operations. True, it complicates your code - but gains are huge!

FUNCTION f_getData_cl(i_column_tx VARCHAR2, i_table_tx VARCHAR2) RETURN CLOB IS

v_cur SYS_REFCURSOR;

v_cl CLOB;

v_tx VARCHAR2(32767);

v_buffer_tx VARCHAR2(32767);

PROCEDURE p_flush IS

BEGIN

dbms_lob.writeappend(v_cl,length(v_buffer_tx), v_buffer_tx);

v_buffer_tx:=null;

END;

PROCEDURE p_add (i_tx VARCHAR2) IS

BEGIN

IF length(i_tx)+length(v_buffer_tx)>32767 THEN

p_flush;

v_buffer_tx:=i_tx;

ELSE

v_buffer_tx:=v_buffer_tx||i_tx;

END IF;

END;

BEGIN

dbms_lob.createTemporary(v_cl,true,dbms_lob.call);

OPEN v_cur FOR

'SELECT '||

dbms_assert.simple_sql_name(i_column_tx)||' field_tx'||

' FROM '||dbms_assert.simple_sql_name(i_table_tx);

LOOP

FETCH v_cur into v_tx;

EXIT WHEN v_cur%notfound;

p_add(v_tx);

END LOOP;

p_flush;

CLOSE v_cur;

RETURN v_cl;

END;

On a reasonably large data set that second code will buy back a lot of IO operations - really a lot! I strongly recommend you to run those cases by yourself - only in that case you will believe the performance gains :) But if you are too, well, busy - I will get stats in the next post of these series :)

Monday, May 27, 2024

The Power of Proactive Monitoring. Part 5

Something worth mentioning while talking about Oracle Hierarchical Profiler is that its scope is a database session. So, if your middle-tier logic has multiple database calls within established connection, you can benefit from the profiler data. To be fair, it has a number of restrictions as I show further down, but still it can be useful.

Here is a basic example that shows "both sides of the coin":

String sql =
"begin dbms_hprof.start_profiling(location=>'IO',filename=>'Case1a.txt'); end;";
CallableStatement stmt = conn.prepareCall(sql);
stmt.execute();

PreparedStatement stmt =
conn.prepareStatement("select listagg(owner_tx,',') within group(order by 1) r1 \n" +
"from (select distinct scott.f_change_tx(owner) owner_tx\n" +
" from scott.test_tab) A ");
stmt.execute();

stmt =
conn.prepareStatement("select listagg(owner_tx,',') within group(order by 1) \n" +
"from (select distinct scott.f_change_tx(owner) owner_tx\n" +
" from scott.test_tab) B ");
stmt.execute();

sql = "begin dbms_hprof.stop_profiling; end;";
stmt = conn.prepareCall(sql);
stmt.execute();

As you can see, the code is running between start and stop functionally same queries with a slight modification (so SQL IDs wouldn't match). Both of those SQL statements would cause 50k of function calls. And that scenario is more probable in the real life - reasonably few development environments have established PL/SQL APIs as their communication mechanisms between layers (to my huge sorrow, I am a big fan of "smart DB" approach).

But would they be shown correctly? As I spoilered above - not exactly:

Since there is no "root" PL/SQL entry point, HProf gets confused and merges both SQL calls into a single one - you see 100k function calls - i.e. you can get at least some stats on what is going on after the root object is being called, but no SQL ID, no clear info about the root element.

What is interesting, the same behavior can be replicated via SQL*Plus:

SQL> exec dbms_hprof.start_profiling

2 (location=>'IO',filename=>'Case1b_SQLPlus.txt');

SQL> select listagg(owner_tx,',') within group (order by 1) r1

2 from (select distinct scott.f_change_tx(owner) owner_tx

3 from scott.test_tab a);

...

SQL> select listagg(owner_tx,',') within group (order by 1)

2 from (select distinct scott.f_change_tx(owner) owner_tx

3 from scott.test_tab b);

...

SQL> exec dbms_hprof.stop_profiling;

As you can see, no SQL ID, and all function calls are being merged.

Summary of this part: unless you have PL/SQL APIs you may not enjoy the whole power of HProf, but still you can benefit from some of its provided information

Friday, May 24, 2024

The Power of Proactive Monitoring. Part 4

In the previous post I introduced one of my favorite performance debugging tools - Hierarchical Profiler. Since Oracle engineers were so proactive that instrumented SQL and PL/SQL engines with a clear mechanism to understand who calls whom inside of the database, it is just great that we (as developers) can have access to the same level of internal information. That information indeed tells us complete and true story of what is happening - not guesses, not estimates, real fingerprints of every involved element.

Now let's get a bit more real and look at the problem that I bet a lot of people encounter if they start using user-defined PL/SQL functions within SQL - it is a very powerful technique, but it also can cause a lot of damage if used inappropriately/without understanding of the internals.

Imagine the following scenario:

Our help-desk started to get a lot of performance complains related to the same module
Corresponding developer checked 10046 trance and couldn't find anything suspicious
The ticket is elevated to a senior performance expect (well, me :) )

And the very first thing I notice about the core query is that is contains a user-defined PL/SQL function. I immediately ask the developer - do you KNOW how many times that function is executed during the normal activity? The answer I've got was: "Well, it shouldn't be too much..."

Let's pause for a sec. We are soo accustomed to Cost-Based optimizer to do various miracles - but the database engine works only as good as data we provided. And it is important to check whether its logic matches ours. Here was the clear case of my developer "assume" instead of "check". So, the right solution is to get some hard statistics: I wrapped the suspect SQL call into HProf start/stop (below is a simplified case):

SQL> exec dbms_hprof.start_profiling ('IO', 'HProf_Case1.txt');
SQL> declare
2 v_tx varchar2(32767);
3 begin
4 select listagg(owner_tx,',') within group (order by 1)
5 into v_tx
6 from (
7 select distinct scott.f_change_tx(owner) owner_tx
8 from scott.test_tab
9 );
10 end;
11 /
SQL> exec dbms_hprof.stop_profiling;

Just to be clear - the table TEST_TAB has 50000 rows with only 26 distinct owners, and the function F_CHANGE_TX does basic formatting, nothing special... The HPROF report surprised everybody - 50K calls to the function!

This means that time was wasted on a very cheap function which was just fired lots and lots of times, because original developer just guessed the query behavior. He knew that the function would do basic formatting of the repeated value, so the output always will be the same - but he forgot to tell that to Oracle. And without that information the database fired mentioned above function for every row!

The solution was pretty simple - the query was rewritten to help CBO:

SQL> exec dbms_hprof.start_profiling ('IO', 'HProf_Case1_fix.txt');

SQL> declare
2 v_tx varchar2(32767);

3 begin

4 select listagg(owner_tx,',') within group (order by 1)

5 into v_tx

6 from (

7 select scott.f_change_tx(owner) owner_tx

8 from (select distinct owner
9 from scott.test_tab)

10 );

11 end;

12 /

SQL> exec dbms_hprof.stop_profiling

And here is a proof: 26 function calls, overall - 28 times faster!

IMHO, that was simple enough and straightforward enough illustration of how simple it is to find a problem if you have the right tool! And it is also a great illustration of the power of knowing something (rather than guessing)

Monday, May 20, 2024

The Power of Proactive Monitoring. Part 3

After talking for a while about some abstract ideas - let's get a bit more technical and a bit closer to Oracle. Before we really talk about the "proactive" part - we should be a bit more realistic. In about 90% of cases the need for monitoring is only being recognized after the fact that something is wrong. As a result the problem of finding "who"who's-done-it" is complicated by not having that much forensic data to work with.

Luckily, in Oracle technology space some of the things are being recorded by Oracle background processes no matter what, and ASH / AWR could help at least to narrow down the area to dig (sometimes). But let's for now not get into the DBA weeds and stay in the developer's realm.

Assume, you can replicate the problem (that's often a big "if") and assume that you can have access to the top level of the spaghetti code you inherited from 30+ years of development. From my experience, the entry point is usually detectable - but nobody living knows what exactly happens inside of the "black box".

Luckily #2, Oracle engineers left us instrumentation to access their internal mechanisms - in my case, calls to SQL and PL/SQL as one program units call the other. That access in implemented via one of the most powerful (and, unfortunately, underutilized) features - Oracle Hierarchical Profiler (HProf). That feature is available since Oracle 11 R1 and supersedes Oracle Profiler (notice the omission of the word "hierarchical"!) . In the las couple of years I made a number of presentations about that product, but I feel that sometimes articles are a bit more helpful (yes, I still can't say the podcast - sorry, I just cant adjust to the video style). This and the following post(s?) will cover some basic ideas why YOU should care about HProf.

Let's first define a use-case: the database is doing "something" and that "something" takes too much time. And key caveat: somebody indeed has proven that it is the database that is slow - not data transmission, not the middle tier, not the client. Remember the classical image that my esteemed guru Dr. Paul Dorsey came up years ago?

Users are always complaining (that's their nature) - and it is your responsibility as a system architect to understand that there are soooo many places where you can lose some critical resources, including time (well, and money!) But only when you think in terms of a logical action (rather than technical call) you can really tune your system:

it doesn't matter how fast is your database if there are tens of thousands of calls from the middle-tier to accomplish a single activity - the time will be spent closing/opening sessions. And only if you can hierarchically step up to the point where the activity is initiated you can see the "big picture"
it doesn't matter how slow is your database if the time is spent on the client side because of various JavaScript performance issues

But let's assume that you found out that you need to dive into the internals of the database. The problem that is that the database calls are rarely something simple - usually you have a combination of PL/SQL units-calling SQL-calling-PL/SQL units-calling other PL/SQL units-calling SQL etc. I.e. you have a hierarchy of database calls. And that's where HProf comes to rescue.

I will start with the simplest module and simplest logging implementation (writing logs to the file system rather than storing it in the database):

SQL> CREATE DIRECTORY IO AS 'C:\IO';
SQL> exec dbms_hprof.start_profiling
(location=>'IO',filename=>'HProf.txt');

SQL> DECLARE
2 PROCEDURE p_doSomething (pi_empno NUMBER) IS
3 BEGIN
4 dbms_lock.sleep(0.1);
5 END;
6 PROCEDURE p_main IS
7 BEGIN
8 dbms_lock.sleep(0.5);
9 FOR c IN (SELECT * FROM emp) LOOP
10 p_doSomething(c.empno);
11 END LOOP;
12 END;
13 BEGIN
14 p_main();
15 END;
16 /

SQL> exec dbms_hprof.stop_profiling;

As you can see, nothing fancy: (1) anonymous block calls a procedure (2) procedure wastes half-second and walks a loop based on EMP table (3) for every row in the EMP table another procedure is called that wastes 0.1 sec. All of the processing is started/stopped via DBMS_HPROF calls, and results are written into the file, as I said earlier.

If you want to read the raw file - it is pretty simple, but still doesn't provide any insights:

P#V PLSHPROF Internal Version 1.0
P#! PL/SQL Timer Started
P#C PLSQL."".""."__plsql_vm"
P#X 8
P#C PLSQL."".""."__anonymous_block"
P#X 6
P#C PLSQL."".""."__anonymous_block.P_MAIN"#980980e97e42f8ec #6
P#X 63
P#C PLSQL."SYS"."DBMS_LOCK"::9."__pkg_init"
P#X 7
P#R
P#X 119
P#C PLSQL."SYS"."DBMS_LOCK"::11."SLEEP"#e17d780a3c3eae3d #197
P#X 500373
P#R
P#X 586
P#C SQL."".""."__sql_fetch_line9" #9."4ay6mhcbhvbf2"
P#! SELECT * FROM SCOTT.EMP
P#X 3791
P#R
P#X 17
<<… and so on …>>

P#C is a call, P#R - return from the subprogram, P#X - elapsed time etc. But you need an aggregator - PLSHPROF routine:

C:\Utl_File\IO>plshprof -output hprof_intro HProf.txt
PLSHPROF: Oracle Database 12c Enterprise Edition Release 12.2.0.1.0 - 64bit Production
[8 symbols processed]
[Report written to 'hprof_intro.html']

Now the raw data finally makes sense (the resulting report has a lot of sorting options, I picked up the most explicit one):

All of the steps exactly as I described above - including SQL ID of executed query. So, if needed you have a way to link that report with V$SQLAREA and/or DBMS_XPLAN analysis.

To be continued...