Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2232 |
Symbol | hemH |
ID | 5670631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2669519 |
End bp | 2670784 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241152 |
Product | ferrochelatase |
Protein accession | YP_001506573 |
Protein GI | 158314065 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0276] Protoheme ferro-lyase (ferrochelatase) |
TIGRFAM ID | [TIGR00109] ferrochelatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0180991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00658585 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAAGGGG ACATGACGGT GCCGACGGAC ACCGCCGGCC ACGGCCGCGG GATCGACGTG GACGCGCTGC TGCTCGTCTC CTTCGGCGGT CCGGAGGGGC CGGAGGACGT CCTGCCGTTC CTGCGCAACG TGACCCGCGG CCGCGGGGTG CCCGAGGCGC GGCTGGCCGA GGTCGCGGCG CACTACGACC GCTTCGGCGG GCGCAGCCCG ATCAACGACC AGAACCGGGC GCTGCTCGCC GCGCTGCGGG AGCGGCTGGC CCCGATGCCC GTGTACTGGG GCAACCGGAA CTGGCAGCCG TACCTCGCCG ACGCCGTGGC GCGCATGCGG GCCGACGGCG TGCGCCGGGC GGCCTGCTTC GTGACGTCCG CGTTCGCGTC CTACTCGGGG TGCCGGCAGT ACCGGGAGGA TCTGGCGGCG GCCCTGGAGC AGGTCGGCCC GGGCGCCCCC GACCTGGTGA AGCTGCGGCT GTTCTTCGAC CACCCGGGAT TCGTCGAGCC GATGGTCGAC CACTGCGTGC GGGCGCTCGC GTCGCTGCCG GCGGCGGTGC GCGACGAGGC CCGGCTGGTG TTCACCGCCC ATGCGCTGCC CCGCTCGCAG GCGGCCGCGA GCGGGCCGGA CGGCGGAGCC TACGAGCGCC AGCTCCGCGC GGCCGCCGGT GTGATCGCGC AGCGGGTCGC CGCGCGCGCC GGTACCCGGC ACGAGTGGCA GGTCGCCTAC TGCAGCCGCA GCGGCCCGCC GAGCGTGCCC TGGCTCGAAC CGGACGTCAA CGACGCCCTG GCGGCGCTCG CGGACGGCGG CGCGCGCGCC GCGGTGATCG TGCCGGTCGG TTTCGTCAGC GACCACATGG AGGTCGTCTA CGATCTCGAC GTCGAGGCGG TGCGCACCGC CTCGGAACGC GGTCTGGCGG TCGCCCGGGC GGGCACCGTC GGGACGGACC CGCGCTTCGT CGGGATGGTG GCCGATCTGG TCGCCGAGCG CCGCTTCCCC GAGTGGGAGC GGCCCGCGCT GAGCGGCGAG GGGCCGTCCC ACGACGTCTG CCCGCTGCAC TGCTGTGACC CGGGTGCCCG CCGCCCGGCG GCGGCCGGTG TTCCGGCGGA TCTCTGCGCG CACGGCCCGG CACGTTCCGG CGCGACGGCG TCACAGCCGG GCAGGCCGAC GAACGCGGAG CGCTCCCGGG ATGGTGAACC GCGCAACGAA GCGACGATAC TGGTGGATGA GCGGGCAGGT ATCTCGCGGT CCGACTCTCA CCCCGCAGGC GAATAA
|
Protein sequence | MEGDMTVPTD TAGHGRGIDV DALLLVSFGG PEGPEDVLPF LRNVTRGRGV PEARLAEVAA HYDRFGGRSP INDQNRALLA ALRERLAPMP VYWGNRNWQP YLADAVARMR ADGVRRAACF VTSAFASYSG CRQYREDLAA ALEQVGPGAP DLVKLRLFFD HPGFVEPMVD HCVRALASLP AAVRDEARLV FTAHALPRSQ AAASGPDGGA YERQLRAAAG VIAQRVAARA GTRHEWQVAY CSRSGPPSVP WLEPDVNDAL AALADGGARA AVIVPVGFVS DHMEVVYDLD VEAVRTASER GLAVARAGTV GTDPRFVGMV ADLVAERRFP EWERPALSGE GPSHDVCPLH CCDPGARRPA AAGVPADLCA HGPARSGATA SQPGRPTNAE RSRDGEPRNE ATILVDERAG ISRSDSHPAG E
|
| |