Gene Mlg_2668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2668 
Symbol 
ID4268801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3020731 
End bp3021942 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content72% 
IMG OID638127427 
Productprotein of unknown function DUF513, hemX 
Protein accessionYP_743498 
Protein GI114321815 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2959] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000148689 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGAAA ATAAACCCGA GCGGGAAGAG GAGAAGCCCC GGTCTGAGAA GACCGGCGGC 
GAGGACCCCC AGGGCCAGGA GCTGACGGCC TCTGCGGCCG CGCCCGAGCC GGGCAAGGGG
GGCGGCGGTA CGCCGCCGGC CGGCGGGGAC GGCGACGGTA CCGACAAGGA TCGCCAGGGC
GGGCCCTGGA AACAGGTCGT GGCGTTGCTG GTCGTGGTCC TGGTGCTGGG TGCCGCGGCC
ACCTGGTGGC TGACCGGCGA GATGCGCGAG TTGCGCGCCG AGCAGGCGCG TATGGTCAGT
GCCGACCGGC TGGATGAACG CAGCGACGCC CTCGAGCGGC AACTGGCACG GCTCGAGGAT
CGCGTCACCG ATACCGGCGA GCGCGCCGCC TCCGCCCGTG AGCGGGCCGA CGAGGCCGGC
GATGCCCTGG GGACCCTGCG CGAACAGCTG GACGAGTTGC GCGCCCGCCA GGGCGGTTTC
GAGGAGGGCC TGGAGCGACT GGGTGCGCGC GCCGAGGCCA ACCGGGAGAA CTGGATCCGT
TCCGAGGCGG CCTACCTGGC CACCGTGGCC GTCCACCGGA TGCGCTTCCA CCGCGACCCC
AAGACCGCAC TTGGCGCCCT GCAGGCAGCA GACAAGCTGA TGGCCGACAT CGGTGCCAGC
GAGAGCGTGC CGGCTCGCGT TGCCCTCAAC GAGGCGGTCA CCCAGGTGCT GGAGTGGGCC
CCGCCCGAGG TGGGCCGGCT GGCCGCCACC CTGGCCGACC TGGAAGGCCG GGTGGATGGG
CTGCCCATGC CGGCGGAGCG GGCCACCGGC GGCATCGATC TGCCGCGCAT GGCCGCGGAC
GAGGGCGACC CGGTCTGGCT GGCGCGGCTG AAGGACGCCA CCGGCCGGGT CCAGGCTGGA
TTGGGTGAGC TGGTGGTGGT GCAGCGCGAG GAGGCCGCGC CGCCCCTGGT GGCACCGGAT
CAGCGCTACT TCCTGCGCGA GAACCTCAAG CTGCGCCTGG AGGCGGCCCG ACTGGCCGCA
CTGCAGGGCG ATCAGGACCT GTGGGAGGAC AGCCTGCAGC GGGCCCACGA CTGGGTCCTG
GCCCACTTCG ATACCAGCGA TCTGGATGTG GAGGCGGTGG CGGACACGCT GGCGCGCCTG
CGCCGTCAGG ACATCGACCC GGAGCTGCCG GATATCGCCA CCACCCTGGA ACCGGTCAAG
CCGTTCCTGT AA
 
Protein sequence
MQENKPEREE EKPRSEKTGG EDPQGQELTA SAAAPEPGKG GGGTPPAGGD GDGTDKDRQG 
GPWKQVVALL VVVLVLGAAA TWWLTGEMRE LRAEQARMVS ADRLDERSDA LERQLARLED
RVTDTGERAA SARERADEAG DALGTLREQL DELRARQGGF EEGLERLGAR AEANRENWIR
SEAAYLATVA VHRMRFHRDP KTALGALQAA DKLMADIGAS ESVPARVALN EAVTQVLEWA
PPEVGRLAAT LADLEGRVDG LPMPAERATG GIDLPRMAAD EGDPVWLARL KDATGRVQAG
LGELVVVQRE EAAPPLVAPD QRYFLRENLK LRLEAARLAA LQGDQDLWED SLQRAHDWVL
AHFDTSDLDV EAVADTLARL RRQDIDPELP DIATTLEPVK PFL