Gene Mlg_2448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2448 
Symbol 
ID4268754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2781953 
End bp2783149 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content72% 
IMG OID638127206 
Productcoproporphyrinogen III oxidase, anaerobic 
Protein accessionYP_743278 
Protein GI114321595 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.403351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCGC AGGCCGGCAC CCCCCCGCCC CCGCTCGGGG TCTACCTGCA CCTGCCCTGG 
TGCGTGCAGA AATGCCCCTA TTGCGATTTC AACAGCCACG CCCCGGCCCG CGCGGACCGG
CGCGCTGATG CCAGCACCCT GCCCGCCATT CCCCACGAGC GCTACACCCG CGCCGTGCTG
ACGGACCTGG CCAGCGCCGC CGGGCCGCTC CAGGGCCGGC GCGTGGAGAC GGTCTTCATT
GGCGGCGGCA CCCCCAGTCT GTTCCCGCCC GAGGCCATCG GCGGGTTGCT GGAGGCGCTG
GACCGGCGCC TGGGTCTGAC CGGCGATGCG GAGATCACCC TCGAGGCGAA TCCGGGCACC
GTGGAGCAGG GTCGGTTCCA CGGCTACCGG GCGGCCGGGG TCAACCGGCT TTCCATCGGG
GTGCAAAGTT TCGATGCCGG CGCGCTGCGC CGTCTGGGCC GCATTCATGG CCCGGAGGAG
GCCCGCCGGG CGGTGCGGGC GGCACGACGG GCTGGCTTCC GGCGTATCAA CCTGGACCTG
ATGTACGCCC TGCCCGGCCA GACCACGGCC CAGGCCCTGG CGGACGTGGA GGCCGCCCTG
GCGCTGGCGC CGGAGCACAT CTCCCACTAC CAGCTCACCC TGGAGCCGGG CACGCCCTTC
CACAGCCGCC CGCCGGCGGA TCTGCCGGAC GAGGCCCGGC TGCTGGCGCT GGAGGCGGCA
TGCCGTGAGC GTCTGGCCGC CGCCGGACTG ACCCGTTACG AGGTCTCGGC CTGGGCGCAC
CCGGGCGAGG CCTGCCGCCA CAACCTCAAT TACTGGCGCT TCGGCGATTA CCTGGGCTTG
GGCGCCGGGG CCCACGGCAA GCTGAGCGAC CCGGCCCGGG ACGAGATCCG CCGTGAGGCC
CGGGTGCGCA TGCCGGGCAC CTATATGGCA CAGGCGGGCA CGCCGGCTGC GATCGCTGAA
CTCCGGCGGC TGCAAAACGG CGATATCGTG CTTGAATTCA TGATGAACGC CCTGCGTCTG
GCAGAGGGCT TTCACCGCGA CGACTGGCGG CGACACACCG GCCGCCCGAC CACCCTGTTT
GAGGACCGGG TGGCCGAGGC GGTGACAGAC GGCCTGCTGA GCGATGACGG GGGGCGCATC
CGGCCCACTC AACGGGGCTG GCAACTGCTG GACGGCCTGC TGCAACGGTT CTTGTGA
 
Protein sequence
MNPQAGTPPP PLGVYLHLPW CVQKCPYCDF NSHAPARADR RADASTLPAI PHERYTRAVL 
TDLASAAGPL QGRRVETVFI GGGTPSLFPP EAIGGLLEAL DRRLGLTGDA EITLEANPGT
VEQGRFHGYR AAGVNRLSIG VQSFDAGALR RLGRIHGPEE ARRAVRAARR AGFRRINLDL
MYALPGQTTA QALADVEAAL ALAPEHISHY QLTLEPGTPF HSRPPADLPD EARLLALEAA
CRERLAAAGL TRYEVSAWAH PGEACRHNLN YWRFGDYLGL GAGAHGKLSD PARDEIRREA
RVRMPGTYMA QAGTPAAIAE LRRLQNGDIV LEFMMNALRL AEGFHRDDWR RHTGRPTTLF
EDRVAEAVTD GLLSDDGGRI RPTQRGWQLL DGLLQRFL