Gene Athe_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0666 
Symbol 
ID7407090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp751628 
End bp753286 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content42% 
IMG OID643715047 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_002572563 
Protein GI222528681 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATTGA CAGGAGCTGA GATTATAATT GAATGTTTAA AAGAACAGGG CGTAAATGTT 
GTTTTTGGTT ATCCGGGTGG TGCTGCTTTA AATATTTATG ATGCTCTTTA CAAACATCAA
AATGAGATAA AACATTATCT AACATCCCAT GAACAGCACG CATCCCACGC TGCAGACGGA
TATTCAAGAG CGTCCGGCAA GGTTGGCGTG GTTTTTACTA CCTCAGGACC TGGTGCAACA
AACATTGTGA CAGGTATTGC AACAGCGTAC ATGGACTCTG TGCCGGTTGT GGCAATCACA
GGGCAAGTAC CAACTAATTT GCTTGGTAAA GACTCGTTCC AAGAGGTTGA TATTACAGGT
ATTACCATGC CAATCACAAA GCACAATTTT ATTGTTAAAG ATGTAAATAC ACTTGCAGAC
ACAATCAGAC GCGCATTTGA GATTGCGCAG AGTGGAAGAC CTGGACCTGT TTTGGTTGAC
GTTTGCAAAG ATGTGACAGC AGCATATGCT GAATATGAGA AGAAAGAACC TAAAAAGATA
AAAAAGAAGG TATTAGCAAC AAAAGAAGAG ATAGAAAAGG CAATAGAACT TATCAATGCA
AGCGAGAGAC CTTTTATCTG CTCAGGTGGC GGTGTTATTT CATCTGAAGC CTCTGAAGAA
CTTATTGAAT TTGTGGAAAA AATAAACGCA CCTGTTGCAA CAACGCTCAT GGGTGTTGGT
GGATTTCCTT CTACCCATCC GAACTATACA GGGCTTGTTG GAATGCATGG CACAAGGGCT
TCTAATTATG CAGTTTCGCA CTGTGACCTT TTGATTGCTG TTGGTGCAAG ATTTTCAGAT
AGGGTAATCA GCAAGGTTGA CAGGTTTGCA CCAAATGCAA AGATTATTCA CATTGATATT
GATCCGGCTG AGATTGACAA AAATATAAGC ACAGACATTG CGCTTGTTGG CGATGTAAAG
CAGATATTAA AAATCTTGGT TGAGAATGTA CAGAAGAAGA CTAACACAGA CTGGATAGAG
ATGATTTACG AATGGAAGAA AAACTATCCT TTGAGCTATC CTCAAGATGG CAAGCTTCAT
CCCCAGTATG TTGTTGAGAG AATTTCTGCT CTTACCAACA ACGATGCAAT AATCACAACA
GAGGTTGGAC AAAACCAGAT TTGGGCAGCA CAATATTACA AATACCAAAG ACCAAGACAA
TTTATTTCCT CTGGCGGGCT TGGTACAATG GGTTATGGCT TTGGTGCGGC AATTGGAGCA
AAGATAGCAA AGCCGGACAA AGTAGTCATT GACATTGCAG GTGATGGCAG CTTTAGGATG
AACTGTGGCG AGCTTGCAAC AGCTGTGCAC TACAATATTC CTGTGATAGT TGCGCTGCTT
AACAATAGTG TTCTGGGAAT GGTTCGCCAG TGGCAGGACC TTTTCTATGG CAAGAGATTT
TCACAAACAA CTCTTGACAG GCCGCCTGAT TTTGTCAAGC TTGCAGATGC ATATGGTGCA
GTTGGCATAA GAGTTACATC GCCAGATGAG GTTGACAGGG CTATTTTAAA AGCGTTAGAG
GCAGGAAGAC CAACAGTAAT TGACTTTGTA ATTGACAAAG ACGAAAAAGC GCTGCCAATT
GTCCCACCCG GCGCGCCAAT TGATGAGATT ATAGACTAA
 
Protein sequence
MKLTGAEIII ECLKEQGVNV VFGYPGGAAL NIYDALYKHQ NEIKHYLTSH EQHASHAADG 
YSRASGKVGV VFTTSGPGAT NIVTGIATAY MDSVPVVAIT GQVPTNLLGK DSFQEVDITG
ITMPITKHNF IVKDVNTLAD TIRRAFEIAQ SGRPGPVLVD VCKDVTAAYA EYEKKEPKKI
KKKVLATKEE IEKAIELINA SERPFICSGG GVISSEASEE LIEFVEKINA PVATTLMGVG
GFPSTHPNYT GLVGMHGTRA SNYAVSHCDL LIAVGARFSD RVISKVDRFA PNAKIIHIDI
DPAEIDKNIS TDIALVGDVK QILKILVENV QKKTNTDWIE MIYEWKKNYP LSYPQDGKLH
PQYVVERISA LTNNDAIITT EVGQNQIWAA QYYKYQRPRQ FISSGGLGTM GYGFGAAIGA
KIAKPDKVVI DIAGDGSFRM NCGELATAVH YNIPVIVALL NNSVLGMVRQ WQDLFYGKRF
SQTTLDRPPD FVKLADAYGA VGIRVTSPDE VDRAILKALE AGRPTVIDFV IDKDEKALPI
VPPGAPIDEI ID