Gene Mlg_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1372 
Symbol 
ID4268135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1571137 
End bp1572573 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content63% 
IMG OID638126128 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_742211 
Protein GI114320528 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.292242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGTGC CACACATCCC CAATCTCGGC ATTGTGCGAT TTGTCATGGA CCAAACACTG 
CAATTCGACC CGGCCATCCT GGCCAAGTAT GACGTCAGCG GCCCCCGGTA CACCTCTTAC
CCCACGGCTC CGCAGTTCCA CGAGGGCTTC GACGACAAGG CCTATGCCGA GGTGGCCGCG
CTCAGCAACG AGGATCCCAT CCCGCGGCCG CTTTCGCTGT ACGTGCACGT CCCCTTCTGC
GACACCGTCT GCTTCTACTG TGCGTGCAAC AAGATCATCA CCGGTAACTA CTCCCGCGCC
GGTACCTACC TGGACTACCT CGAGAAAGAG GTTGCCCTGC AGAGCCAGCT GTTCGACGCC
GACCGGCGGG TGGAGCAACT GCACTTCGGG GGCGGCACCC CGACCTACCT CAGTGACGAG
GATCTCATCC GGGTAATGGA CATGCTCTCC CGCTATTTCA CCCTCGAGCG GGGGCCGCAA
CGGGAGTTCT CTATCGAGAT CGACCCGCGG GCCGTGCGGG AGACCACGAT CGAGCTGCTG
GCCAAGCTGG GCTTCAACCG CATGAGCGTG GGGGTGCAGG ACTTCGATCC CGCGGTGCAA
AAGGCCGTGA ACCGGATCCA GCCCTACGAG ACCACCGCCC GGGTGATCGC AAAGGCGCGT
AGCTGCGGAT TCCGTTCCAC CAACCTCGAC CTGATTTATG GGCTGCCGCT GCAGAGCGTA
GAGACCTACT CCCGGACCCT GGATCAGACC CTGGAACTGC GCCCCGAGCG CCTGGCGGTC
TACAATTACG CCCATCTACC CCACCTGTTC AAAGTCCAGC GCCAGATCCG CGAGGAGCAG
CTCCCGGGGC CCGACGAAAA GCTGGCCATC CTGGAGCTGA CCATCCGCCG GCTAACCGAG
GCCGGCTATG TCTACATCGG CATGGACCAC TTCGCCCTGC CGGAGGACGA GCTCGCCCAG
GCCCAGCGCG CCGGGACGCT GCACCGCAAT TTCCAGGGCT ACTCTACCCG TGCGGAGTGC
GACTTGGTGG GCCTGGGGGT CACCTCCATT GGCAAGGTGG GCGAGAGCTA CAGTCAGAAC
CTGCGTGATA TGGAGGCCTA CTACGCCCGC CTGGATGAAG GCCGGTTGCC GGTCTTCCGG
GGCGTGGAGC TCAGCGCCGA CGATCAACTG CGCCGCGACG TGATCACTGA GATCATGTGC
CACTCCCGGG TGGACTTCCG GGAGATGGAG GAGCGTTACC AGATCCACTT CCGGGACTAC
TTCCGCGACG CCCTGGAACG GCTGCAGGGG ATGGAGTCGG ATGGTTTGGT CCGAATCGAG
AGCGACCGGT TACAGGTCCT CCCCCGAGGG CGTTTGCTGC TGCGCCACGT GGCCATGGCC
TTCGACGCCT ACCTGAATGC CGACAACGGC AAAACCCGCT ACAGTAAGGT CATATAG
 
Protein sequence
MPVPHIPNLG IVRFVMDQTL QFDPAILAKY DVSGPRYTSY PTAPQFHEGF DDKAYAEVAA 
LSNEDPIPRP LSLYVHVPFC DTVCFYCACN KIITGNYSRA GTYLDYLEKE VALQSQLFDA
DRRVEQLHFG GGTPTYLSDE DLIRVMDMLS RYFTLERGPQ REFSIEIDPR AVRETTIELL
AKLGFNRMSV GVQDFDPAVQ KAVNRIQPYE TTARVIAKAR SCGFRSTNLD LIYGLPLQSV
ETYSRTLDQT LELRPERLAV YNYAHLPHLF KVQRQIREEQ LPGPDEKLAI LELTIRRLTE
AGYVYIGMDH FALPEDELAQ AQRAGTLHRN FQGYSTRAEC DLVGLGVTSI GKVGESYSQN
LRDMEAYYAR LDEGRLPVFR GVELSADDQL RRDVITEIMC HSRVDFREME ERYQIHFRDY
FRDALERLQG MESDGLVRIE SDRLQVLPRG RLLLRHVAMA FDAYLNADNG KTRYSKVI