Gene Mlg_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2084 
Symbol 
ID4269403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2361885 
End bp2362844 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content74% 
IMG OID638126840 
Producthypothetical protein 
Protein accessionYP_742916 
Protein GI114321233 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase
[COG1051] ADP-ribose pyrophosphatase 
TIGRFAM ID[TIGR00586] mutator mutT protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.498942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC TGCACGTGGC CGTGGGTGTC ATCCTCGACG ACCGGCAGCG GGTACTGGTG 
GCGCGCCGGG CCGCCCACCG CCACCAGGGC GGGCGGTGGG AGTTTCCCGG CGGCAAGGTG
GAGCCGGGCG AGACGGTGGT GCAGGCGCTC TGCCGCGAAC TCGAGGAGGA GTTGGCGATC
AGCCCCACCC GCACCTCGCC GATGATGCGC ATCGAACACG ACTACCCGGA CCGCCGCGTC
AGCCTGGATG TGCACCGGGT GAGCGCCTGG CGGGGCGAGC CACGCGGGCT CGAGGGCCAG
CCGCTGGCCT GGCTGAGGGC CACGGAGTTG GCCCGCCGGC CTTTTCCGCA GGCCAATCTC
CCCATCATCC GACGGCTGGC CCTGCCGCCC TTTCTGATCA TCACCGAGCC GCTGGCCCCC
GGTGACCTGG CGGGCCTGGC GCGCCGGCTC CAGTCGCTGG CCGTGCCGGC TCGCGGGGCC
TGGCTGCAGC TGCGTCTGCC GGACTGGGAT GATCGGGCCT ATGGCCGGGC GCTGGCGTTG
GCCATCAGGA CCCTGGGGCC CCGGGGGGTG GACGTGACCG CGAACCGCTC ACCCGCGGTG
GCACGCCGCG CCGGTGGTCA CGCCCTGCAC CTGAACGCCC GCGCGCTGAT GGCCTGCGAG
GCGCGTCCCG AGGGCTTTGT GCGGGTGGGG GCCTCCTGCC ACAGCCCTGA GGAACTGGCC
CGGGCCGAGG CCCTGGGGCT GGACTATGCG CTGCTCTCTC CGGTCGCCGC CACGGCCTCG
CACCCCCGGC AGGTGCCGTT GGGCTGGGAG CGATTCCGGG ACTGGCTGGG CCGGGTGGAC
CTGCCGGTCT ACGCCTTGGG TGGCTTGGGG CCGGAGGCGC TGGAGTTGGC CTGGGCCCAT
GGGGCGCACG GGGTGGCGGG GATCCGCGGC TTCTGGCCGC CGCGCGGATC GCCGCCATAG
 
Protein sequence
MARLHVAVGV ILDDRQRVLV ARRAAHRHQG GRWEFPGGKV EPGETVVQAL CRELEEELAI 
SPTRTSPMMR IEHDYPDRRV SLDVHRVSAW RGEPRGLEGQ PLAWLRATEL ARRPFPQANL
PIIRRLALPP FLIITEPLAP GDLAGLARRL QSLAVPARGA WLQLRLPDWD DRAYGRALAL
AIRTLGPRGV DVTANRSPAV ARRAGGHALH LNARALMACE ARPEGFVRVG ASCHSPEELA
RAEALGLDYA LLSPVAATAS HPRQVPLGWE RFRDWLGRVD LPVYALGGLG PEALELAWAH
GAHGVAGIRG FWPPRGSPP