Gene Athe_2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2057 
Symbol 
ID7408270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2172903 
End bp2175221 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content38% 
IMG OID643716424 
Productalpha-xylosidase YicI 
Protein accessionYP_002573907 
Protein GI222530025 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA CAGACGGTTT TTGGCGTGTA AAAGATGGAA TAAGATTATA TCATCCAGCC 
CATATATATG ATTACGAAAT TTCGAAAGAC TCAACCACAA TTATTGCGCC AGCTCAATTT
ATTACAAACA GAGGACAAAC CTTACAAGGT CCTGTTTTCA CTATACGTTT TTCTTCACCT
TTTGAAGATG TTATAAGAGT GCAAATTTGG CACTACAAGG GTCAAAAAGA TAAAAAGCCA
TATTTTGAAT TTTATAAAGA AGAAGGATAT TGCCCTTTGA TAGAAGTTTT TTCGGAGAGT
ATAGTAATAA CAAGTGGAAA GCTAAAAGCT GTTATTAATA GAAAAGGTGA ATGGAAAGTA
GCATATTACT ACGAAGATAA ATATCTAACA AGAAATGGTT ATAAATATCT TGGTTACGCA
ATCATGCCTG ATAATACTAC TTACATGAGG GAACAGCTTT CTTTGAGTGT TGGAGAGTGT
GTTTACGGGT TGGGCGAGAG GTTTACTCCT TTTGTTAAAA ACGGACAAAT GATTGATATG
TGGAACGAAG ATGGTGGTAC GAACTCTGAT CTTGCATACA AAAACATTCC TTTTTACATT
ACAAACCGTG GATATGGTGT TTTTGTAAAT GACCCAGGAC GAGTGTCATT TGAAGTAGCC
ACAGAGAATG TCGAGAGAGT TCAGTTTTCT GTGGAAGGTG AATATTTGGA ATATTTCATA
ATTGGCGGTA GCAACATGAA AAATGTTTTA GAAAATTACA CAAAACTCAC AGGTCGGCCA
CAGCTTCCTC CAGCATGGTC TTTTGGACTT TGGCTTACAA CCTCTTTTAC AACAAGCTAT
GATGAAAAGA CTGTTACAAA CTTTATAGAT GGAATGATTG AAAGGGATAT TCCACTTCAT
GTGTTTCATT TTGACTGTTT CTGGATGAAA GATATGCACT GGGTTGATTT TGAGTGGGAC
AGAAGGGTTT TTCTTGAACC ATCACAGATG CTAAAGCGTC TAAAAGAAAA GGGAGTAAAA
ATATGTGTTT GGATAAATCC CTATATATCT CAGCTTTCTA AACTGTTTGA CGAAGGCAAA
GAAAAAGGGT ATTTTTTGAA AAAGCCAAAT GGTGATGTAT GGCAGACAGA TGATTGGCAG
CCTGGTATGG CAATTGTTGA TTTTACAAAC CCTGAGGCGT GCAGGTGGTA TTCAGAAAAG
CTCAAAGAGC TAATTAAAAT GGGAGTTGAC TGTTTTAAGA CAGATTTTGG TGAAAGAATT
CCAACAGATG TTGTTTATTT TGATGGTTCA GACCCTCAAA AGATGCACAA TTACTACACC
TATCTTTACA ACAAGACAGT ATATGAGACG CTTCAAGAAA CGTTTGGCAA GGGAAATGCA
GTTGTTTTTG CAAGGTCAGC GACAGTAGGA AGCCAGAAAT TTCCTGTGCA CTGGGGCGGA
GACTGTTTAG CTTCATATGA GTCCATGGCA GAGACACTCA GGGGTGGCCT TTCACTTTCA
CTTTGCGGGT TTGGTTTTTG GAGTCATGAC ATAGGGGGGT TTGAGAGTAC AGCAACACCA
GATCTTTACA AGAGATGGGT AGCATTTGGA CTTTTATCTT CTCACAGCAG ACTTCATGGA
AATTCTGCCT ATAAAGTTCC ATGGCTTTAT GACGAAGAGG CGGTTGACGT ACTTAGGTTC
TTTACAAAAT TAAAATGTAA ACTTATGCCA TACATCTTTT CAGCGGCTGT AGAGGCAACA
GAAAGAGGGA TTCCAGTCTT GAGGCCAATG GTCTTAGAGT TTCCGGACGA TCCTGCTTGT
CTTTATCTTG ACAGGCAATA TATGCTTGGA GACAGTCTTT TGGTTGCACC AATCTTTTCA
GAAGATGGAT ATGTTGAGTA TTATGTGCCA GAAGGGATTT GGACAAATAT CCTGACAGGT
GAAAAAGTTG AGGGTGGCAA GTGGAGAAAA GAAAAGCACG GCTATTTTAG CCTTCCACTT
TTAGCAAGGC CAAATACTGT AATCCCAATG GGAAGTGTAG ACACAAAGCC CGATTATGAT
TATGCTGATA ATGTGGCGAT GAATATTTAT CATATTGATA GTGGGGAGAC ATTGAAATCT
CAGATAAGAA ATGTAGAAGG TAAGGCAGAA ATTGAAATTG AAGTGAGAAG ACACGGAGAT
GTTATTTATG TCACAAATAT AAGAGATTCA AAAAAAACAT GGAGTCTATA TTTTGATTCT
CTAAGGATAG AAGTTATATC TGGAGCAAGT GTAAAGGTCG ACAGTAATGG TAGTAAGATA
AATGTATATT CTGACACAGC TGTGTTAAAA GTGATTTAA
 
Protein sequence
MKFTDGFWRV KDGIRLYHPA HIYDYEISKD STTIIAPAQF ITNRGQTLQG PVFTIRFSSP 
FEDVIRVQIW HYKGQKDKKP YFEFYKEEGY CPLIEVFSES IVITSGKLKA VINRKGEWKV
AYYYEDKYLT RNGYKYLGYA IMPDNTTYMR EQLSLSVGEC VYGLGERFTP FVKNGQMIDM
WNEDGGTNSD LAYKNIPFYI TNRGYGVFVN DPGRVSFEVA TENVERVQFS VEGEYLEYFI
IGGSNMKNVL ENYTKLTGRP QLPPAWSFGL WLTTSFTTSY DEKTVTNFID GMIERDIPLH
VFHFDCFWMK DMHWVDFEWD RRVFLEPSQM LKRLKEKGVK ICVWINPYIS QLSKLFDEGK
EKGYFLKKPN GDVWQTDDWQ PGMAIVDFTN PEACRWYSEK LKELIKMGVD CFKTDFGERI
PTDVVYFDGS DPQKMHNYYT YLYNKTVYET LQETFGKGNA VVFARSATVG SQKFPVHWGG
DCLASYESMA ETLRGGLSLS LCGFGFWSHD IGGFESTATP DLYKRWVAFG LLSSHSRLHG
NSAYKVPWLY DEEAVDVLRF FTKLKCKLMP YIFSAAVEAT ERGIPVLRPM VLEFPDDPAC
LYLDRQYMLG DSLLVAPIFS EDGYVEYYVP EGIWTNILTG EKVEGGKWRK EKHGYFSLPL
LARPNTVIPM GSVDTKPDYD YADNVAMNIY HIDSGETLKS QIRNVEGKAE IEIEVRRHGD
VIYVTNIRDS KKTWSLYFDS LRIEVISGAS VKVDSNGSKI NVYSDTAVLK VI