Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1334 |
Symbol | hemE |
ID | 3906547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1601250 |
End bp | 1602359 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878667 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_480440 |
Protein GI | 86740040 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0406905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGTCC TCCACGTTGA CGCGCGACCC GGATCCGGGC CGGGTGGCGT CTCGCCCCCG CCGAGCGGTG CAGCGCTCGC CCGGCGGCCC GGCCTGGCCG ACACCGCGCC CTTCCTGCGT GCCTGTCGAC GGGAGCACCC CGGGACCACG CCGGTGTGGT TCATGCGTCA GGCCGGGCGG GTCCTGCCCG AGTACCGGGC CCTGCGCGCG GGAGTCGCCA TGCTCGACTC CTGCCGGGAC GCCGAGATGA TCACGGAGAT CACGCTCCAG CCGGTACGCC GGTTCCGGCC GGACGCGGCG ATCTTCTTCT CCGACATCGT GGTGCCCCTG GTCGCGATCG GCCTCGACAT CGACATCGTC GCCGGGATCG GACCGGTGGT GGCCGAGCCC GTGCGGGACG CCGTCGGGCT CGCCGCGTTG CGTGCACTGG AGCCGGACGA CGTTCCCTAC GTGGCCGACG CGGTCCGGTT CCTGCTGGCC GAGCTGGGTT CAACCCCGCT GATCGGGTTC GCCGGGGCGC CGTTCACCCT CGCGAGCTAC CTCATCGAGG GCGGACCGAG TCGCGACCAC GCCCGCACCA AGGCGTTGAT GTACAGCGAA CCGAAGCTCT GGCACGCCCT GCTGGCCCGG CTCGCCGACA TCACCACCGC CTTCCTGCGC GTCCAGGTGG ATGCCGGTGT TGACGCGCTG CAGCTGTTCG ACTCCTGGGC CGGGGCGCTG GACGAGGCGG ACTACCGTCG CTACGTCGCG CCGCACAGCG CTCGGGTGCT GGCGGCCTTC GCCGGTGAGG TGCCGCGCAT CCACTTCGGT GTGAACACCG GTGAGCTGCT CGCCGCGATG GGCCAGGCGG GTGCGGACGT CGTCGGCGTC GACTGGCGGG TCCCTCTCGA CGAGGCCGCC CGGCGGATCG GGCCCGGTCA TGCCGTGCAG GGAAACCTCG ACCCGACCGC GGTCTTCGCC CCCGAACCGG TGCTCGCCGC CAAGGTGCGC GACGTCTGCG CCCGCGGGGC CGAGGCAGAG GGGCACGTGT TCAACCTCGG CCACGGGGTG CTGCCGCAGA CCGATCCGGG CGTGCTCGCG CACGTCGCCG ACCTTGTCCA CGGCGGATGA
|
Protein sequence | MPVLHVDARP GSGPGGVSPP PSGAALARRP GLADTAPFLR ACRREHPGTT PVWFMRQAGR VLPEYRALRA GVAMLDSCRD AEMITEITLQ PVRRFRPDAA IFFSDIVVPL VAIGLDIDIV AGIGPVVAEP VRDAVGLAAL RALEPDDVPY VADAVRFLLA ELGSTPLIGF AGAPFTLASY LIEGGPSRDH ARTKALMYSE PKLWHALLAR LADITTAFLR VQVDAGVDAL QLFDSWAGAL DEADYRRYVA PHSARVLAAF AGEVPRIHFG VNTGELLAAM GQAGADVVGV DWRVPLDEAA RRIGPGHAVQ GNLDPTAVFA PEPVLAAKVR DVCARGAEAE GHVFNLGHGV LPQTDPGVLA HVADLVHGG
|
| |