Gene Mlg_0380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0380 
Symbol 
ID4269005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp424206 
End bp425171 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content72% 
IMG OID638125111 
Productthiamine-monophosphate kinase 
Protein accessionYP_741225 
Protein GI114319542 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.450658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.175688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAT TTCAACTCAT CCGCAGCTAC TTCCAGCCCG ACGCCAGGGG CGAGGGCGTG 
GTGTTGGGGG TGGGTGACGA CGCCGCCCTG CTGCAGCCCG CCCCGGGCCA ACTGCTGGTC
ACCTGCGTGG ACACCCTGGT GGCCGGCGTG CACTTCCCCG AAGACGCCCC GCCGGACGCC
GTCGGCCACA AGGCCCTGGC GGTGAACCTC AGCGACCTGG CCGCCATGGG CGCCCGCCCC
CGCTGGTTCC AGTTGGCGCT CACCCTCCCG GAGATCGACG AGGCCTGGCT GGCGGCCTTC
TCCAGCGGCC TGCACCGCCT GGCCGCCGAG CAGGACGTGG CCCTGGTCGG CGGCGACACC
ACCCGCGGGC CCCTCACCAT CACGGTCCAG GCCATGGGCG AGGTGGCGCC CGCGCAGGCA
TTGCGCCGGA CCGGCGCCCG CGCCGGCGAC CGGCTCTACG TGACCGGCAC CCTGGGTGAT
GCCGCCCAGG GGCTGGCCCT GTGGCAGCGG GGCGTGCGGT CTGCCGGCGG AGATGACCCG
GCCGGCTTTC TGATCGACCG GCTGCACCGC CCCACCCCCC GGATGGCCGC AGGCCGCGCG
GCGGCGGGCC TGGCTCGGGC CGCCATCGAT ATCTCCGACG GCCTGCTCGC GGACCTGGGG
CACCTGTTGG AAGGTGGCGA AGGGCTGGGG GCCGTGCTGC AGGCCGACAG CCTGCCGCTC
TCGCCCGCGT ACCGCGCGCA CTGCGAAGAC TCTTTACCGG GGCGGGCGGC CCTGTCCGGG
GGCGACGACT ACGAGTTGCT GTTTGCGGTG GCCCCCGAGA ACGAGGCGGC ATTCCAAACG
GCCCTGCAAC ATGTGCCGGC TGGATGCACC TGCATTGGCT GGATCACCGA GGATTCGGCG
ATTACCCTGC AAGGGGACGG AAAGGCGCAG GTCCTGACCC GCCAGGGGTA TCAGCACTTC
AACTAG
 
Protein sequence
MDEFQLIRSY FQPDARGEGV VLGVGDDAAL LQPAPGQLLV TCVDTLVAGV HFPEDAPPDA 
VGHKALAVNL SDLAAMGARP RWFQLALTLP EIDEAWLAAF SSGLHRLAAE QDVALVGGDT
TRGPLTITVQ AMGEVAPAQA LRRTGARAGD RLYVTGTLGD AAQGLALWQR GVRSAGGDDP
AGFLIDRLHR PTPRMAAGRA AAGLARAAID ISDGLLADLG HLLEGGEGLG AVLQADSLPL
SPAYRAHCED SLPGRAALSG GDDYELLFAV APENEAAFQT ALQHVPAGCT CIGWITEDSA
ITLQGDGKAQ VLTRQGYQHF N