Gene Franean1_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2232 
SymbolhemH 
ID5670631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2669519 
End bp2670784 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content75% 
IMG OID641241152 
Productferrochelatase 
Protein accessionYP_001506573 
Protein GI158314065 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0180991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00658585 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAAGGGG ACATGACGGT GCCGACGGAC ACCGCCGGCC ACGGCCGCGG GATCGACGTG 
GACGCGCTGC TGCTCGTCTC CTTCGGCGGT CCGGAGGGGC CGGAGGACGT CCTGCCGTTC
CTGCGCAACG TGACCCGCGG CCGCGGGGTG CCCGAGGCGC GGCTGGCCGA GGTCGCGGCG
CACTACGACC GCTTCGGCGG GCGCAGCCCG ATCAACGACC AGAACCGGGC GCTGCTCGCC
GCGCTGCGGG AGCGGCTGGC CCCGATGCCC GTGTACTGGG GCAACCGGAA CTGGCAGCCG
TACCTCGCCG ACGCCGTGGC GCGCATGCGG GCCGACGGCG TGCGCCGGGC GGCCTGCTTC
GTGACGTCCG CGTTCGCGTC CTACTCGGGG TGCCGGCAGT ACCGGGAGGA TCTGGCGGCG
GCCCTGGAGC AGGTCGGCCC GGGCGCCCCC GACCTGGTGA AGCTGCGGCT GTTCTTCGAC
CACCCGGGAT TCGTCGAGCC GATGGTCGAC CACTGCGTGC GGGCGCTCGC GTCGCTGCCG
GCGGCGGTGC GCGACGAGGC CCGGCTGGTG TTCACCGCCC ATGCGCTGCC CCGCTCGCAG
GCGGCCGCGA GCGGGCCGGA CGGCGGAGCC TACGAGCGCC AGCTCCGCGC GGCCGCCGGT
GTGATCGCGC AGCGGGTCGC CGCGCGCGCC GGTACCCGGC ACGAGTGGCA GGTCGCCTAC
TGCAGCCGCA GCGGCCCGCC GAGCGTGCCC TGGCTCGAAC CGGACGTCAA CGACGCCCTG
GCGGCGCTCG CGGACGGCGG CGCGCGCGCC GCGGTGATCG TGCCGGTCGG TTTCGTCAGC
GACCACATGG AGGTCGTCTA CGATCTCGAC GTCGAGGCGG TGCGCACCGC CTCGGAACGC
GGTCTGGCGG TCGCCCGGGC GGGCACCGTC GGGACGGACC CGCGCTTCGT CGGGATGGTG
GCCGATCTGG TCGCCGAGCG CCGCTTCCCC GAGTGGGAGC GGCCCGCGCT GAGCGGCGAG
GGGCCGTCCC ACGACGTCTG CCCGCTGCAC TGCTGTGACC CGGGTGCCCG CCGCCCGGCG
GCGGCCGGTG TTCCGGCGGA TCTCTGCGCG CACGGCCCGG CACGTTCCGG CGCGACGGCG
TCACAGCCGG GCAGGCCGAC GAACGCGGAG CGCTCCCGGG ATGGTGAACC GCGCAACGAA
GCGACGATAC TGGTGGATGA GCGGGCAGGT ATCTCGCGGT CCGACTCTCA CCCCGCAGGC
GAATAA
 
Protein sequence
MEGDMTVPTD TAGHGRGIDV DALLLVSFGG PEGPEDVLPF LRNVTRGRGV PEARLAEVAA 
HYDRFGGRSP INDQNRALLA ALRERLAPMP VYWGNRNWQP YLADAVARMR ADGVRRAACF
VTSAFASYSG CRQYREDLAA ALEQVGPGAP DLVKLRLFFD HPGFVEPMVD HCVRALASLP
AAVRDEARLV FTAHALPRSQ AAASGPDGGA YERQLRAAAG VIAQRVAARA GTRHEWQVAY
CSRSGPPSVP WLEPDVNDAL AALADGGARA AVIVPVGFVS DHMEVVYDLD VEAVRTASER
GLAVARAGTV GTDPRFVGMV ADLVAERRFP EWERPALSGE GPSHDVCPLH CCDPGARRPA
AAGVPADLCA HGPARSGATA SQPGRPTNAE RSRDGEPRNE ATILVDERAG ISRSDSHPAG
E