Gene Saro_0118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0118 
SymbolhemE 
ID3916004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp119850 
End bp120875 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID640442843 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_495401 
Protein GI87198144 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGCC CTCTTCTGAA GACGCTCCAG GGTGAGAACA TTTCCCGCCG ACCGATCTGG 
CTCATGCGCC AGGCCGGACG CTATCTGCCC GAGTACCGCG AGCTTCGCGC CGAGAAGGGC
GGCTTCCTCG CGCTGGTCTA CGACACTGAC GCAGCGGCCG AAGTTACCGT GCAGCCGATC
CGTCGTTTCG GCTTCGACGG CGCGATCCTG TTTTCCGACA TCCTGATCGT ACCCTATGCG
ATGGGACAGG ATCTCCAGTT CCTCGCCGGC GAAGGTCCGC ACCTGTCACC ACGCTTGCTC
GACGCCGCGC TGAACAGCCT CGTGGCGGTG CCCGGGCGCC TCTCGCCGAT CTACGAGACG
GTTGCCAAGG TGAAGGCCCA GCTTTCGCCT GAAACCACGC TGCTCGGCTT TGCCGGCAGT
CCGTGGACGG TCGCAACCTA CATGGTGGCC GGCGAAGGCA GCCGTGACCA TCACGATACC
CGCGCGCTTG CCTATCGTGA TCCTTCGGCG TTCCAGGCAA TCATCGATGC GATTACGGAA
GTGACCATCG AGTATCTTTC GGGCCAGGTC GAAGCGGGTG CGGAAGGGCT GCAACTGTTC
GATTCTTGGT CGGGCAGCCT TGCTCCGGCC GAATTCGAAC GTTGGGTCAT CGCGCCCAAC
GCCAGGATCG CCTCCGCGAT GCAGCAGCGT TATCCCCACG TGCCTGTGAT CGGGTTCCCC
AAGGGCGCTG GCGAAAAGCT TTCCGCCTAT GCCCGCGAGA CAGGCGTCAA CGCGGTCGGC
GTGGACGAAA CCATCGATCC GTTATGGGCT GCGCGCGAAC TCCCGGCGAA CATGCCGGTA
CAGGGCAATC TCGATCCGCT TCTGCTCCTT TCGGGCGGCC CTGAGCTGGA ACGGCAGACG
ATCCGTGTTC TCGAAGCCTT TGCCGACCGC CCGCACGTCT TCAATCTTGG CCACGGCATC
GGTCAGCACA CTCCGATCGA AAACGTCGAA GCGCTTCTGA AGATCGTGCG AGGCTGGTCG
CGCTGA
 
Protein sequence
MPGPLLKTLQ GENISRRPIW LMRQAGRYLP EYRELRAEKG GFLALVYDTD AAAEVTVQPI 
RRFGFDGAIL FSDILIVPYA MGQDLQFLAG EGPHLSPRLL DAALNSLVAV PGRLSPIYET
VAKVKAQLSP ETTLLGFAGS PWTVATYMVA GEGSRDHHDT RALAYRDPSA FQAIIDAITE
VTIEYLSGQV EAGAEGLQLF DSWSGSLAPA EFERWVIAPN ARIASAMQQR YPHVPVIGFP
KGAGEKLSAY ARETGVNAVG VDETIDPLWA ARELPANMPV QGNLDPLLLL SGGPELERQT
IRVLEAFADR PHVFNLGHGI GQHTPIENVE ALLKIVRGWS R