Gene Mlg_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2137 
Symbol 
ID4269877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2433464 
End bp2434519 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content68% 
IMG OID638126893 
Productalcohol dehydrogenase 
Protein accessionYP_742969 
Protein GI114321286 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0423941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA TGCAGGCAGC GGTCTTCGTC GAGCCTGGCC GCATCGAGGT CCAGGAAAAA 
CCCATACCGG AGATCGGGCC CACCGACGCC CTGCTCCGGG TCACTACCAC CACCATCTGC
GGCACCGATG TCCACATCCT CAAGGGCGAG TATCCGGTGG AACCGGGTCG GATCGTCGGT
CATGAGCCGG TGGGGGTGAT CGAGAAGCTG GGCGAGGCGG TGACCGGTTA TGAGCCCGGT
CAGCGGGTGA TCGCCGGGGC CATCACCCCC TGCGGCCAAT GCCACGCCTG CCAGGACGGG
GTCTCCAGCC AGTGCGGCGG CAAGGCCATG GGCGGCTGGC AGTTGGGCAA CACCATCGAC
GGCTGCCAGG CGGAGTACGT CCGCATCCCC AACGCCCAGG CCAACCTGAC CCCGGTGCCG
GACGAGCTGA CCGACGAGCA GGTGCTGATG TGCCCGGACA TCATGAGCAC CGGGTTCGGC
GGTGCGGAAA ACGGCCATAT CCGGATCGGC GACACCGTGG CCATCTTCGC CCAGGGGCCG
ATCGGGCTGT GCGCCACCGC CGGCGCCAAG CTGATGGGGG CCACCCGGAT CATCGTGGTG
GACGGGGTCC CCGAGCGGCT GGAGACCGCC CGCAAGCTGG GCGCCGACGT TGGCGTCAAC
TTCCGCGAGC AGGATCCGGT CGAGGCCATC ATGGAACTGA CCGGGGGACG CGGCGTGGAT
GTCGCAATCG AGGCCCTGGG GCTTCAGGAG ACCTTCGAGG CCTGCCTGCG GGTGCTCAAA
CCCGGCGGCA CCCTGTCCAG CCTGGGGGTC TACTCCGGAA AACTCTCCAT GCCGCTGGAC
GCCATTGCCG CTGGCTTGGG CGATCACACC ATCGTGACCA CCCTCTGCCC CGGCGGCAAG
GAGCGCATGC GCCGGCTGAT GGAGGTGGTG GCCGCCGGCC GGGTGGACCT GACCGCCATG
GTGACCCACC GCTACACGCT GGACCAGATC GTCGAGGCCT ACGACCTTTT CTCGCACCAG
CGCGACGGCG TGCTGAAGGT GGCGATCACC CCATGA
 
Protein sequence
MSTMQAAVFV EPGRIEVQEK PIPEIGPTDA LLRVTTTTIC GTDVHILKGE YPVEPGRIVG 
HEPVGVIEKL GEAVTGYEPG QRVIAGAITP CGQCHACQDG VSSQCGGKAM GGWQLGNTID
GCQAEYVRIP NAQANLTPVP DELTDEQVLM CPDIMSTGFG GAENGHIRIG DTVAIFAQGP
IGLCATAGAK LMGATRIIVV DGVPERLETA RKLGADVGVN FREQDPVEAI MELTGGRGVD
VAIEALGLQE TFEACLRVLK PGGTLSSLGV YSGKLSMPLD AIAAGLGDHT IVTTLCPGGK
ERMRRLMEVV AAGRVDLTAM VTHRYTLDQI VEAYDLFSHQ RDGVLKVAIT P