Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0118 |
Symbol | hemE |
ID | 3916004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 119850 |
End bp | 120875 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640442843 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_495401 |
Protein GI | 87198144 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGCC CTCTTCTGAA GACGCTCCAG GGTGAGAACA TTTCCCGCCG ACCGATCTGG CTCATGCGCC AGGCCGGACG CTATCTGCCC GAGTACCGCG AGCTTCGCGC CGAGAAGGGC GGCTTCCTCG CGCTGGTCTA CGACACTGAC GCAGCGGCCG AAGTTACCGT GCAGCCGATC CGTCGTTTCG GCTTCGACGG CGCGATCCTG TTTTCCGACA TCCTGATCGT ACCCTATGCG ATGGGACAGG ATCTCCAGTT CCTCGCCGGC GAAGGTCCGC ACCTGTCACC ACGCTTGCTC GACGCCGCGC TGAACAGCCT CGTGGCGGTG CCCGGGCGCC TCTCGCCGAT CTACGAGACG GTTGCCAAGG TGAAGGCCCA GCTTTCGCCT GAAACCACGC TGCTCGGCTT TGCCGGCAGT CCGTGGACGG TCGCAACCTA CATGGTGGCC GGCGAAGGCA GCCGTGACCA TCACGATACC CGCGCGCTTG CCTATCGTGA TCCTTCGGCG TTCCAGGCAA TCATCGATGC GATTACGGAA GTGACCATCG AGTATCTTTC GGGCCAGGTC GAAGCGGGTG CGGAAGGGCT GCAACTGTTC GATTCTTGGT CGGGCAGCCT TGCTCCGGCC GAATTCGAAC GTTGGGTCAT CGCGCCCAAC GCCAGGATCG CCTCCGCGAT GCAGCAGCGT TATCCCCACG TGCCTGTGAT CGGGTTCCCC AAGGGCGCTG GCGAAAAGCT TTCCGCCTAT GCCCGCGAGA CAGGCGTCAA CGCGGTCGGC GTGGACGAAA CCATCGATCC GTTATGGGCT GCGCGCGAAC TCCCGGCGAA CATGCCGGTA CAGGGCAATC TCGATCCGCT TCTGCTCCTT TCGGGCGGCC CTGAGCTGGA ACGGCAGACG ATCCGTGTTC TCGAAGCCTT TGCCGACCGC CCGCACGTCT TCAATCTTGG CCACGGCATC GGTCAGCACA CTCCGATCGA AAACGTCGAA GCGCTTCTGA AGATCGTGCG AGGCTGGTCG CGCTGA
|
Protein sequence | MPGPLLKTLQ GENISRRPIW LMRQAGRYLP EYRELRAEKG GFLALVYDTD AAAEVTVQPI RRFGFDGAIL FSDILIVPYA MGQDLQFLAG EGPHLSPRLL DAALNSLVAV PGRLSPIYET VAKVKAQLSP ETTLLGFAGS PWTVATYMVA GEGSRDHHDT RALAYRDPSA FQAIIDAITE VTIEYLSGQV EAGAEGLQLF DSWSGSLAPA EFERWVIAPN ARIASAMQQR YPHVPVIGFP KGAGEKLSAY ARETGVNAVG VDETIDPLWA ARELPANMPV QGNLDPLLLL SGGPELERQT IRVLEAFADR PHVFNLGHGI GQHTPIENVE ALLKIVRGWS R
|
| |