Gene Mlg_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0737 
Symbol 
ID4270498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp821837 
End bp822958 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content67% 
IMG OID638125486 
Productferrochelatase 
Protein accessionYP_741581 
Protein GI114319898 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGT TGACCGTGAA GTCGTTGTTC GAAGGCCGGA AGGGGTTCCG GCACGACGAT 
CAGCCCCGGC TCGGGGTGCT GGTAACCAAC CTGGGTACCC CTGACGCCCC TACCACGCCG
GCCCTGCGCC GCTACCTGCA CGAGTTCCTG TGGGACCCCC GGGTGGTGGA GGCGCCCCGT
TGGATCTGGT GGCTGATCCT CAACGGCATT GTGTTGCGTA CCCGCCCGAA GAAATCCGCC
GCGGCCTACC GTGAGGTCTG GACCGAAGAG GGCTCGCCGC TGCTGATCAT CGGCCGGAAA
CAGACCCGGG GCATCCTCGA GCGGTTGCAG TCGCGCCTGC AGGGGCCCGT GGTGGCGGAG
CTGGCCATGC GCTACGGCAA CCCTTCCATC GCCTCCGGAC TGCGCAGGCT GCGCGATCAG
GGAGCAGAGC GCATCGTGGT GCTGCCGCTC TATCCCCAGT ATTCCGGCTC CACGACCGGG
TCCACCTTCG ATGCCGTTGC GGACGAGCTC AAGCGCTGGC GGTGGGTGCC GGAGTTCCGC
TTCATCGGCC AGTACCACGA CGATGAGCGT TACATCGAGG CCCTGGCCGC CAGCATCCGC
GAGCACTGGG CCGAGCACGG CCGCGGCGAG AAGCTGCTGT TTTCCTTCCA CGGGACCCCG
CGCCGCTACC TGCTCGACGG CGACCCCTAT CACTGCCAGT GTCAGAAGAC GGCCCGCCTG
GTGGCCGAGC GCCTGGAGCT GTCCGACGAG GCCTGGCAAG TCACCTTCCA GTCGCTCTTT
GGCAAGGAGG TCTGGCTGCA GCCCTACACC GATGCCACCG TGGAGCAGCT GGCCCGCTCG
GGGCTGAAGA CCCTGGACGT GATCTGCCCG GGTTTCTCGG CGGACTGCCT GGAGACCCTA
GAGGAGATCG AGGGCGAGAA TGCCGAGATC TTCCAGGAGC ACGGCGGCGA TAAGCTGCGC
TACATCAAGG CGTTGAACGA CCGGGACGAC CACTTGGAGA TGCTGGCCGG CCTGGTGCAT
GAGCATAGCC AGGGCTGGCC GGAGGCTGGG GGGCCGGCGC GCACCCTGCG CGACCCGCAG
GCCACCCAAG AGCGGGCCAG GGCACTGGGG TCCGATGTCT GA
 
Protein sequence
MSQLTVKSLF EGRKGFRHDD QPRLGVLVTN LGTPDAPTTP ALRRYLHEFL WDPRVVEAPR 
WIWWLILNGI VLRTRPKKSA AAYREVWTEE GSPLLIIGRK QTRGILERLQ SRLQGPVVAE
LAMRYGNPSI ASGLRRLRDQ GAERIVVLPL YPQYSGSTTG STFDAVADEL KRWRWVPEFR
FIGQYHDDER YIEALAASIR EHWAEHGRGE KLLFSFHGTP RRYLLDGDPY HCQCQKTARL
VAERLELSDE AWQVTFQSLF GKEVWLQPYT DATVEQLARS GLKTLDVICP GFSADCLETL
EEIEGENAEI FQEHGGDKLR YIKALNDRDD HLEMLAGLVH EHSQGWPEAG GPARTLRDPQ
ATQERARALG SDV