Gene Mlg_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2038 
Symbol 
ID4268154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2307874 
End bp2309751 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content67% 
IMG OID638126794 
Productacetolactate synthase, large subunit 
Protein accessionYP_742870 
Protein GI114321187 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.206948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCG CAACCGCGAA GACCGAAGAG TACACCGACA ACCAGGTTCA TCCCATGGCC 
GGCCAGACCA TGTCCGGTTC CGACATGATC GTAGAGGTAC TGGCCCAGGA AGGTGTCGAC
ACGGTATTCG GTTACAGCGG CGGCGCCATC CTCCCCACCT ACGACGCGAT CTTCCGCTTC
AACGAGCGGC ACCGCCCGGC GGGCGCCGGG GACGATCCCA TCAAGCTGAT CGTGCCCGCC
AACGAGCAGG GTGCCGGTTT CATGGCCGCG GGCTACGCGC GGGCCAGTGG CAAGGTGGGG
GTGTTCATGG TGACCTCCGG CCCGGGGGCG ACCAACACCG TCACCCCCAT CCGCGATTGC
ATGGCCGACT CCGTGCCGGT GGTGGCCATC ACCGGGCAGG TGCCCACCCA CGCCATGGGC
ACCGACGCTT TCCAGGAGGC GCCCATCGTC AACATCATGG GGGCTTGCGC CAAACACGTC
TTCCTGGTGA CCAAGCCCGA GCAGTTGGAG GCCACCATCC GCACCGCCTT CGAGGTGGCC
CGTTCCGGCC GGCCGGGGCC GGTGGTGGTG GACGTGCCCA AGGACTCGCA GAACTGGATT
GGCCAGTTCC AGGGCAGCGG GTTGCTGGAC ATGCGCGGCT ACCGCCAGCG TATGGAGTCG
CTGCGCTCGG CCAAGCTCTC CGACCGCAAG TGTCGCCAGT TCTTTGAAAT GCTGGAGCGC
TCCCGCCGGC CCCTGATCTA TGCCGGCGGG GGCGTGATCA ACGGCAATGC GGCCGAGCAG
CTTCGCGAGT TTGCTCGTAC CTTCCGCATC CCGGTGGTGA CCACCCTGAT GGGGCTGGGC
GCTGTGGATA CCACCGACGA ACTTTGTATG GGCATGCTCG GCATGCACGG GTTCGCCCAC
GCCAACTACG CCGTCGAGGA CTGTGACTTC CTGATCGCCG TGGGCTCGCG CTTTGACGAT
CGCGTGGCCG GAAAGGTGGC GGAGTTCGCG CCCCTGGCCG AGCAGATCGC GCACATCGAC
ATCGACGCCG CCGAGATCGG CAAGGTTAAA TCGGTGGACT GGGCCCATGT GGGCGAGGCC
GGGCGGAGCC TGCGCCAGCT GCTGCGTTAT GGCGAGTCCA TGGGCTTCAA GGGCGAGTTC
CAGCCCTGGC TGGACCACTG CACCGCCCTG CGCGAGCGCC ACGGCATGGA CTACGACCGG
GAGAGCGAAC TCATTCAGCC CCACTACGTC ATCGAGGAGA TGAACAAGCT CACCGACGGG
CGGGCGATTG TGGCCACCGG CGTAGGCCAG CACCAGATGT GGGCGGCCCA GTACTGCGAC
TTCCGGGAGC CGCGGCTGTG GCTCACCTCC GGCAGTATGG GCACCATGGG CTTTGGGCTG
CCCGCCGCCA TCGGCGCCCA GTTCGCCCGC CCGGACGCCC TGGTCATCGA CGTGGACGGC
GATGGCAGCC TGCGCATGAA CCTGGGCGAG TTGGAGACCG CCACCACCTA CGGCCTGCCG
GTCAAGGTCC TGCTGTTGAA CAACTGCGGC GATGGCATGG TGCGCCAGTG GCAGCGGCTG
TATTTCGGCG ACCGCTTCTC CGGCTCCGAC AAGTCCCTGC ACCGGAAGGA CTTCATCAAG
TGTGCCGAGT CGGATGGCTT CGAGTTCGCG CGCCGGGTCA GCGACAAGGC CGAAGTGACG
GAGGCACTCA AGGCCTTCCT CAACTTCGAC GGTCCGGCCT TCCTGGAGGT CCTGATTGAC
CCCGAGGCCT CGGTGCTGCC CATGGTGGGG CCGGGTGCCG GCTACAAGGA GATGGTGACC
GGTGACTGGA TTACCCCGCG GGAGCGGCCG TTACAGCCCC GCGATAGGGA CGAGCAGGAG
GCCCCGGACC TGTTCTGA
 
Protein sequence
MDSATAKTEE YTDNQVHPMA GQTMSGSDMI VEVLAQEGVD TVFGYSGGAI LPTYDAIFRF 
NERHRPAGAG DDPIKLIVPA NEQGAGFMAA GYARASGKVG VFMVTSGPGA TNTVTPIRDC
MADSVPVVAI TGQVPTHAMG TDAFQEAPIV NIMGACAKHV FLVTKPEQLE ATIRTAFEVA
RSGRPGPVVV DVPKDSQNWI GQFQGSGLLD MRGYRQRMES LRSAKLSDRK CRQFFEMLER
SRRPLIYAGG GVINGNAAEQ LREFARTFRI PVVTTLMGLG AVDTTDELCM GMLGMHGFAH
ANYAVEDCDF LIAVGSRFDD RVAGKVAEFA PLAEQIAHID IDAAEIGKVK SVDWAHVGEA
GRSLRQLLRY GESMGFKGEF QPWLDHCTAL RERHGMDYDR ESELIQPHYV IEEMNKLTDG
RAIVATGVGQ HQMWAAQYCD FREPRLWLTS GSMGTMGFGL PAAIGAQFAR PDALVIDVDG
DGSLRMNLGE LETATTYGLP VKVLLLNNCG DGMVRQWQRL YFGDRFSGSD KSLHRKDFIK
CAESDGFEFA RRVSDKAEVT EALKAFLNFD GPAFLEVLID PEASVLPMVG PGAGYKEMVT
GDWITPRERP LQPRDRDEQE APDLF