Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5175 |
Symbol | hemE |
ID | 5673509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6210539 |
End bp | 6211621 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244029 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001509439 |
Protein GI | 158316931 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0841992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTCG GAAGTACCGC CGTTGCGCCG CCTCGCCCCG GGGTCTCGCC CCGACGTCCA GGTCTCGCCG CGACTTCGCC GTTCCTGCGC GCCGCCGCCG GGGACCGCCC CGACAGCGTG CCCGTGTGGT TCATGCGGCA GGCCGGCCGG GTGCTTCCCG AGTACCGCGC CCTGCGCGCC ACCACCGCCA TGCTCGATTC CTGCCGCAAC GCCGACATGG TCACCGAGAT CACCCTGCAG CCGGTGCGCC GTTTCCGGCC CGACGCCGCG ATCTTCTTCT CGGACATCGT CCTGCCGCTC GCGGCCGTCG GGGTGGACGT CGACATCGTC GCCGGGGTCG GGCCCGTGGT GGCCCATCCC GTCCGGGCGC CGTCCGACCT GGACGTGCTG CGCCCGCTGG AGCCCGGTGA CGTGCCCTAC GTGAGCGAGG CGGTGGCCTC CCTGGTCCGC GAGCTCGGGC AGACGCCGCT GATCGGTTTC GCCGGCGCGC CGTTCACCCT GGCCAGCTAT CTGATCGAAG GCGGGCCGAG CCGCAACCAC ACCCGCACGA AGGCGCTCAT GTACGCCGAG CCGGCGCTGT GGCACGACCT GCTCGGCCGG CTCGCCGACA TCACGGCCGC CTTCCTGCGG GTGCAGGTCG ACGCCGGCGC CGACGCCATC CAGCTGTTCG ACTCGTGGGC GGGCGCGCTC AGCGAGGACG ACTACCTGCG CTACGTGGCT CCGCACAGCA CCCGGGTTCT CGCGGCGTTC GCCGACGACG GCATCCCGCG CATCCACTTC GGGGTGAACA CCGGGGAGCT GCTCGGCGCG ATGGGCGCGG CCGGCGCGGA CGTCGTCGGG GTCGACTGGC GCGTCCCGCT GGACGAGGCC GCCCGCCGGG TCGGCCCCGG TCGTGCCGTG CAGGGCAACC TCGACCCGGC CGCGGTCTTC GCCCCGTCCG ACGTGCTCGC GGCGAAGGTC CGCGACGTCT GCCGCCGGGG TGCGGCCGCT CCCGGGCACA TCTTCAACTT CGGGCACGGC GTGCTTCCGG AGAGTGATCC GGGCGTGCTG GCGCACATCG TGGACCTCGT CCACCAGTTC TGA
|
Protein sequence | MSLGSTAVAP PRPGVSPRRP GLAATSPFLR AAAGDRPDSV PVWFMRQAGR VLPEYRALRA TTAMLDSCRN ADMVTEITLQ PVRRFRPDAA IFFSDIVLPL AAVGVDVDIV AGVGPVVAHP VRAPSDLDVL RPLEPGDVPY VSEAVASLVR ELGQTPLIGF AGAPFTLASY LIEGGPSRNH TRTKALMYAE PALWHDLLGR LADITAAFLR VQVDAGADAI QLFDSWAGAL SEDDYLRYVA PHSTRVLAAF ADDGIPRIHF GVNTGELLGA MGAAGADVVG VDWRVPLDEA ARRVGPGRAV QGNLDPAAVF APSDVLAAKV RDVCRRGAAA PGHIFNFGHG VLPESDPGVL AHIVDLVHQF
|
| |