Gene Athe_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0502 
Symbol 
ID7408626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp571477 
End bp573084 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content41% 
IMG OID643714884 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_002572401 
Protein GI222528519 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0235829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA TGACGGTAGC ACGGGCAATG GTAGAGGTTT TAAAAAGCGA AGGTGTAGAG 
ATTATCTTTG GCATTCCAGG TGCGGCAATA TATCCGTTTT ATGATGCTCT TTATGATTCT
AACATAAAGC ATGTCCTTGT GAGAACAGAA CAGGCAGCAG TTCATGAGGC AAGTGGATAT
GCGCGCACAA CTGGCAAGGT AGGTGTGTGC GTTGCAACCT CTGGGCCTGG TGCTACAAAT
CTTATAACTG GCATTGCAAC TGCATATATG GATTCAGTTC CTATTGTGGC TATCACAGGT
CAGGTAAATT CAAGTTTAAT TGGAAAAGAT GTGTTTCAAG AAGTGGATAT AACAGGGGCA
ACAGCTCCGT TTACCAAGCA TAACTATCTT GTAAAAGACC CTAAAAAAAT TGTAAGGATT
TTAAAAGAAG CATTCTATAT AGCTTCAACA GGAAGACGAG GACCTGTTTT GATAGATGTT
CCTATAGATG TTCAGATGCA GGAGATTGAA TTTGAAATCC CAAAAGAAAT TGATATTCCT
GGCTACAAGC CAAAAGAAAA AGGGCATCCT CTGCAGATAA AAAGGGCAGT AGAGGCAATA
GAAAGCTCAA AAAGACCTGT TGTATGCAGC GGTGGTGGAG TTATTGCATC AGGGGCATCA
GAGGAGCTTC GAATTTTAGT AGAAAAGCAA AAGATCCCTG TAATTTCAAC CCTAATGGGA
ATTGGCTGTA TTCCTACAGA CCATCCTTAC TATCTTGGCA TGATAGGCTC ACATGGGCAA
AGAGAGGCAA ATTTGGCGCT CAGACAAGCA GACCTTTTAA TTGTGATAGG TGCGCGGCTT
GCCGACAGGG CGCTGGGTGA TACAAAGATT ACTGACAATA TGAAGATTAT TCATATAGAC
ATTGACCCTG CTGAGATAGG CAAAAATGTT GACACAAACA TTCCAATTGT TGGCGATGCA
AAACAGGTGC TTTCAGAGAT TAATAAAAGA ATTTCAGAAA GAAAAGATTT TTGGGCTCAT
GAGATTAAAG CGCAAAGGAA AGTTCTTCCA GATGATGACA AGCTTCATCC TTATGATGTG
CTAAGAGAAA TTTCAAGGGC ATACAATGGT GATTATATAA TCACAACAGA TGTTGGTCAG
CATCAGATTT GGGCGGCTCA TAATCTATAT ATCAAAGAAC CAGGAACCTT TATAACTTCG
GGTGGACTTG GTACAATGGG ATATGGCGTC CCTGCTGCAA TTGGCGCAAA GTTTGGAAGA
CCCGAGAAGG AGGTAATTAG CATCACTGGC GATGGAAGTT TTCAGATGCT TTTGCAGGAA
CTTGCAACAA TAAAAAGAGA GCAGGTACCG GTAAAAATTG TGCTTTTTAA CAATACAAGG
CTTGGAATGG TATATGAGCT TCAGAAAAAA AGATGTACAG GCAGATTTAT TGCAACATGC
TTGGATGGTA ACCCTAACTT TATGATATTA GCAAAAGCAT ATGGCATTGA GAGTATGAGG
CTTGAGAGCA AGGAAAAGTT AAAAGAGGCT ATTGAGATTA TGAAAAGCCA CAATGGTCCA
TTTTTGCTTG AAGTTGTAAC AAGCCCTGAT GAGCCAACTA TACCTTAA
 
Protein sequence
MAKMTVARAM VEVLKSEGVE IIFGIPGAAI YPFYDALYDS NIKHVLVRTE QAAVHEASGY 
ARTTGKVGVC VATSGPGATN LITGIATAYM DSVPIVAITG QVNSSLIGKD VFQEVDITGA
TAPFTKHNYL VKDPKKIVRI LKEAFYIAST GRRGPVLIDV PIDVQMQEIE FEIPKEIDIP
GYKPKEKGHP LQIKRAVEAI ESSKRPVVCS GGGVIASGAS EELRILVEKQ KIPVISTLMG
IGCIPTDHPY YLGMIGSHGQ REANLALRQA DLLIVIGARL ADRALGDTKI TDNMKIIHID
IDPAEIGKNV DTNIPIVGDA KQVLSEINKR ISERKDFWAH EIKAQRKVLP DDDKLHPYDV
LREISRAYNG DYIITTDVGQ HQIWAAHNLY IKEPGTFITS GGLGTMGYGV PAAIGAKFGR
PEKEVISITG DGSFQMLLQE LATIKREQVP VKIVLFNNTR LGMVYELQKK RCTGRFIATC
LDGNPNFMIL AKAYGIESMR LESKEKLKEA IEIMKSHNGP FLLEVVTSPD EPTIP