Gene Athe_0704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0704 
Symbol 
ID7407128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp789690 
End bp792161 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content42% 
IMG OID643715076 
ProductTransketolase central region 
Protein accessionYP_002572592 
Protein GI222528710 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGT CACAGTTTAT TGACCCGAAC GAGGTGAGAA AAAGTGGCTG GATAAAATTT 
TTTGATATTC CTGTAAACCA GTATAACAAA ACCTTAGAAG AGGAGAGACA AAACTTTTCG
GATGACCAGC TGATTAGAAT TTACAGAGAC ATGCTTATAA TCCGCGAATT TGAGACAATG
CTCTCTTTAA TAAAAACAAC TGGGGAGTAC AATGGAATAA AGTATGACTA CCCGGGACCG
GCACACCTGT CGATTGGTCA GGAAGCAGTA GCGGTGGGCC AGGCTTTTGT GCTTGATAAA
GATGACTTTA TATTTGGTTC ACATAGAAGT CATGGAGAGG TTATTGCTAA GGGCCTTTCA
ACAATTGAAA AGCTCAGTGA CAATGAGCTT TTAAAAATTA TGGAAAGCTA TTTTGATGGT
TCAATACTTA GAGTTGTGGA AGAAAACTTA AAAAATATCT CAAGTATTAA AGAACTTGCA
GTCAATTTTT TCTTGTATGG CACGCTTGCC GAGATATTTG GAAGAGAGAC TGGGTTTCAA
AAAGGTCTTG GCGGGTCTAT GCATGTGTTC TTCCCACCAT TTGGAATTTA CCCGAACAAT
GCAATTGTTG GAGGGTCTGC TGACATTGCA GTAGGAGCAG CTTTGTTTAA GAAAATCAAT
AAGAAAAATG GCATTGTTGT TGTCAATATT GGCGATGGTT CGATGGCGTG TGGACCTGTA
TGGGAGGCTA TGTGCCTTGC TTCAATGGAC CAATACAAAA AATTGTGGGA TGATGAATAT
AGAGGTGGTC TTCCAATAAT CTTCAATTTT ATGGACAATC AATATGCTAT GGGCGGGCAG
ACACGCGGCG AGACAATGGG ATATGACATG CTTGCAAGAG TCGGAGCAGG CGTTAACCCT
GAGCAGATGC ATGCTGAGCG TGTTGATGGC TACAATCCAC TGGCTGTAAT TGATGCAATG
AAGAGAAAGA AATACCTTCT TGAACAAAAA CAGGGTCCGG TTCTTTTGGA TATTGTCACA
TACAGGCTCA CAGGACACTC ACCATCTGAC TCATCTTCTT ACAGGACAAA AGAGGAGGTT
GAGGCATGGG CAGCTCAAGA CCCAATAGTA ACTTATAAGG ATGAGTTAAT CAAAGCAGGT
GTTGTGACAG AAGAAAAGAT AGAGGAGATT CAAAGCTATG TGAAAGAGCT TATAACAAAG
ATATGTGCTC TTGCTGTTGA TGAAAATGTT TCGCCAAGAA TAAATCTTGT GAAAGACCCT
GATGGTATAG CAAGATATAT GTTCTCAAAC CAGAAGATTG AGAAGATGGA AGACAGAACT
CCTGAGGTTT TGATTCCAAA AGAAGAAAAT CCGCGCGTAA AACAGATAAA AAACAAAATA
AGAGTAGGAA TTGTTGACGG AAAACCTGTT CCAAAGGCAA AGGTGTTCAA TCTCAGAGAC
GCAATATTTG AAGCGCTGCT TGATAAGTTC TACACAGATC CAACACTTAT CTCATACGGG
GAAGACTTGC GCGACTGGGG CGGAGCTTTT GCGGTCTACA GAGGACTTAC AGAGTCTTTG
CCATATCACA GACTATTTAA CACCTGTATC TCAGAAGGTG CAATAGTTGG GTCTGCAGTT
GGATATGGGA TGTGTGGTGG AAGGGTTGTT GTGGAGATAA TGTACTGCGA TTTTATCGGA
AGAGCAGGGG ATGAGATATT CAATCAGCTT GCAAAATGGC AGGCAATGAG CGCAGGGACA
TTGAAAATGC CTGTTGTTGT GAGGGTTTCT GTTGGTTCAA AATATGGTGC ACAGCACTCA
CAGGACTGGT CTTCTATTGT CTCTCACATT CCTGGACTTA AAGTTGTATT CCCAGCAACA
CCTTACGATG CAAAAGGTCT TATGAACAGC GCACTGTCTT CCACAGACCC AGTGATATTT
TTTGAAAGCC AAAGACTGTA TGACATTGGA GAGCTTTTCC ACAAAGAAGG TGTCCCGGAA
GGATATTATG AGGTTCCAAT CGGCGAACCT GATATCAAAA AAGAAGGTAA GGACATTACA
ATCCTGACAG TTGGAGCAAC ACTGTACAGA GCACTTGATG CAGCCAAAAT CTTGGAAGAA
AAGTATGGTG TTAGTGCTGA AATCATTGAT GCGCGGTCGC TCGTACCTTT TAACTATGAG
AAGGTGATTG AATCTGTCAA AAAAACAGGA AAAATTGTAC TGGCTTCTGA CGCATGTGCA
AGGGGCTCAA TTTTGAAAGA CATGGCAGCA ACAATTGCCG ACCTCGCATT TGACTATCTT
GACGCGCCAC CTGTTGTAGT TGGTTCTAAA AACTGGATTG TCCCTGCATA CGAATTTGAA
AACTATTTCT TTCCGCAAGC TGACTGGATT ATTGACGCAA TCCATGAAAG GATTATGCCG
CTCAAAGGTC ATGTGCCAAA GAACAACTTC ACAACAAATG AGATTTTAAG GACAAATAGA
CTTGGTATAT AA
 
Protein sequence
MPKSQFIDPN EVRKSGWIKF FDIPVNQYNK TLEEERQNFS DDQLIRIYRD MLIIREFETM 
LSLIKTTGEY NGIKYDYPGP AHLSIGQEAV AVGQAFVLDK DDFIFGSHRS HGEVIAKGLS
TIEKLSDNEL LKIMESYFDG SILRVVEENL KNISSIKELA VNFFLYGTLA EIFGRETGFQ
KGLGGSMHVF FPPFGIYPNN AIVGGSADIA VGAALFKKIN KKNGIVVVNI GDGSMACGPV
WEAMCLASMD QYKKLWDDEY RGGLPIIFNF MDNQYAMGGQ TRGETMGYDM LARVGAGVNP
EQMHAERVDG YNPLAVIDAM KRKKYLLEQK QGPVLLDIVT YRLTGHSPSD SSSYRTKEEV
EAWAAQDPIV TYKDELIKAG VVTEEKIEEI QSYVKELITK ICALAVDENV SPRINLVKDP
DGIARYMFSN QKIEKMEDRT PEVLIPKEEN PRVKQIKNKI RVGIVDGKPV PKAKVFNLRD
AIFEALLDKF YTDPTLISYG EDLRDWGGAF AVYRGLTESL PYHRLFNTCI SEGAIVGSAV
GYGMCGGRVV VEIMYCDFIG RAGDEIFNQL AKWQAMSAGT LKMPVVVRVS VGSKYGAQHS
QDWSSIVSHI PGLKVVFPAT PYDAKGLMNS ALSSTDPVIF FESQRLYDIG ELFHKEGVPE
GYYEVPIGEP DIKKEGKDIT ILTVGATLYR ALDAAKILEE KYGVSAEIID ARSLVPFNYE
KVIESVKKTG KIVLASDACA RGSILKDMAA TIADLAFDYL DAPPVVVGSK NWIVPAYEFE
NYFFPQADWI IDAIHERIMP LKGHVPKNNF TTNEILRTNR LGI