Gene Athe_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2354 
Symbol 
ID7407773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2497018 
End bp2499333 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content38% 
IMG OID643716718 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002574197 
Protein GI222530315 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAATTG AAAAAAGGGT AAACCAGCTT TTGCAGCAGA TGACAGTTGA AGAAAAGGTG 
TATCAGCTCA CAAGTGTGCT TGTAAAAGAT ATTTTGGAAA ACAACCAATT TTCTGAGGAA
AAAGCAAAGA AAGTCATTCC TCATGGTATT GGCCAGATTA CAAGGGTTGC AGGTGCGAGC
AATTTCACAC CTCAACAGGC TTTAGAGGCA GCAAACCAAA TCCAAAAGTT TTTGATTGAA
AACACAAGGC TCAAAATTCC TGCGATAATC CATGAAGAAT CTTGTTCTGG TTTTATGGCA
AGCAAAGCAA CAGTATTTCC ACAGAGCATT GGTGTTGCCT GCACTTTTGA CAATGAACTT
GTAAAAGAGA TGGCAAAGGT TATAAGGCTG CAGATGAAAG CTGTAGGTGC GCATCAGGCT
TTGGCACCAC TTATTGATGT TGCAAGGGAT GCACGATGGG GAAGGGTTGA AGAGACATTT
GGTGAAGACC CATATCTTGT TGCAAATATG GCAGTAAGTT ATGTTGAAGG AATTCAGGGC
AAGAACTTTG AAGAAAAGAT TATTGCAACA GGCAAACATT TTGTTGGTTA TGCAATGTCA
GAAGGTGGGA TGAACTGGGC ACCTGTTCAT ATTCCTGAAA GAGAGCTAAG AGAAGTGTAT
CTTTATCCAT TTGAGGTCGC TGTTAAAGTG GCAGGATTAA AATCAATTAT GCCAGCTTAC
CATGAAATTG ACGGAATTCC TTGTCATGCA AACAGAAAGC TTTTGACCGA AATTGCAAGG
AATGAATGGA GATTCGATGG AATATTTGTG TCTGACTACA GTGGTGTTAA AAATATCTTA
GACTATCATA AGTCGGTTAA AACTTATGAA GAGGCAGCGT ATATTTCTCT TTGGGCAGGA
CTTGATATTG AACTTCCAAG AATAGAGTGT TTTACTGAGA AGTTTATTGA GGCATTAAAA
GAAGGCAAGT TTGATATGGC AGTTGTTGAT GCTGCTGTGA AGAGAGTTTT AGAGATGAAG
TTCAGGCTCG GACTTTTTGA CAATCCATTT GTAAAAACAG AAAATATTTT AGAACTTTTT
GACAATGAGG AGCAAAGAAG CCTTGCAAGA AAAGTTGCCC AAGAGTCTAT GGTTCTTTTG
AAAAACGACG GTATATTGCC ACTTAAAGAA AAAGAACTCA AGAAAGTTGC TGTGATAGGA
CCTAATGCCA ACTCAGTTAG AAATCTTCTT GGTGATTATT CTTACCCAGC ACACATATCA
ACAACAGAAA TGTTCTTTAT GAAAGAAGAG GTTGACCTCG GCGATGAAGA TGCATTTGTC
AAAAAGGTTG TAAATATTAA ATCTGTATAT GAAGTTATAA AAGAAAGAAT AGGTAAGCAT
ACAGAGGTAG TCTATGCAAA AGGTTGTGAT GTAAACTCTC AAGATAAGTC CAGCTTTGAA
GAAGCTAAAA AAGCTGCCCA GGGCGCAGAT GTTGTTATAG TTGTAGTTGG TGACAAGGCA
GGGTTAAAAC TTGACTGCAC ATCTGGTGAG TCAAGAGATA GAGCAAGCTT AAAACTTCCA
GGTGTTCAGG AAGAGCTGAT AGAAGAAATT TCAAAAGTAA ATCAAAACAT TGTTGTTATT
CTTGTAAACG GTCGACCTGT TGCGCTCGAA AATTTCTGGC AAAAGTCCAA AGCTATTCTT
GAAGCTTGGT TCCCGGGCGA AGAAGGTGCA GAGGCGATTG CAGATGTTAT CTTTGGAAAG
TACAATCCGG GTGGAAAACT TGCAATTTCA TTCCCAAGAG ATGTTGGGCA AGTACCGGTA
TACTATAGTC ACAAACCATC CGGTGGAAAA TCATGCTGGC ATGGGGACTA TGTTGAAATG
TCTTCAAAGC CATTTTTACC ATTTGGTTAC GGTCTTTCGT ATACAACTTT TGAATACAAA
AATCTTACCA TTGAAAAAGA AAAAATTACA ATGGATGAGA GCATAAAAAT CTCGGTTGAG
ATAGAAAATA CAGGAAACTA TGAAGGAGAT GAGGTAGTTC AGCTGTATAC AAGAAAAGAA
GAGTTTTTAG TAACAAGACC TGTAAAAGAG CTAAAGGCAT ACAAGAGAGT TCACTTAAAA
CCTGGTGAAA AGAAGAAAGT TGTATTTGAA ATCTTCCCAG ACCAGTTTGC ATACTATGAT
TATGATATGA ACAGGGTAAT CTCACCCGGC ACTGTTGAGG TCATGGTAGG GGCATCTTCA
GAAGACATAA AGTTTACAGG GACATTTGAG ATTGTTGGGG AAAAGAAAGA TGCAAAAGAA
ATCAAAAATT ATCTTAGCCA TGCATGGTGT GAATAA
 
Protein sequence
MSIEKRVNQL LQQMTVEEKV YQLTSVLVKD ILENNQFSEE KAKKVIPHGI GQITRVAGAS 
NFTPQQALEA ANQIQKFLIE NTRLKIPAII HEESCSGFMA SKATVFPQSI GVACTFDNEL
VKEMAKVIRL QMKAVGAHQA LAPLIDVARD ARWGRVEETF GEDPYLVANM AVSYVEGIQG
KNFEEKIIAT GKHFVGYAMS EGGMNWAPVH IPERELREVY LYPFEVAVKV AGLKSIMPAY
HEIDGIPCHA NRKLLTEIAR NEWRFDGIFV SDYSGVKNIL DYHKSVKTYE EAAYISLWAG
LDIELPRIEC FTEKFIEALK EGKFDMAVVD AAVKRVLEMK FRLGLFDNPF VKTENILELF
DNEEQRSLAR KVAQESMVLL KNDGILPLKE KELKKVAVIG PNANSVRNLL GDYSYPAHIS
TTEMFFMKEE VDLGDEDAFV KKVVNIKSVY EVIKERIGKH TEVVYAKGCD VNSQDKSSFE
EAKKAAQGAD VVIVVVGDKA GLKLDCTSGE SRDRASLKLP GVQEELIEEI SKVNQNIVVI
LVNGRPVALE NFWQKSKAIL EAWFPGEEGA EAIADVIFGK YNPGGKLAIS FPRDVGQVPV
YYSHKPSGGK SCWHGDYVEM SSKPFLPFGY GLSYTTFEYK NLTIEKEKIT MDESIKISVE
IENTGNYEGD EVVQLYTRKE EFLVTRPVKE LKAYKRVHLK PGEKKKVVFE IFPDQFAYYD
YDMNRVISPG TVEVMVGASS EDIKFTGTFE IVGEKKDAKE IKNYLSHAWC E