Gene Athe_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0184 
Symbol 
ID7407175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp231611 
End bp233116 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content36% 
IMG OID643714586 
ProductXylan 1,4-beta-xylosidase 
Protein accessionYP_002572109 
Protein GI222528227 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3664] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATATG TAAAAATTGA ACGAGGAAAA ATATTTGGTG TATTTCCAGA TAATTGGAAA 
TTTTGTGTTG GTAGCGGTCG TATAGGGCTT GCGCTTCAAA AGGAGTATAT GGAAGCATTA
GAATATGTAA AAAAGCATAT TGATTTTAAA TATTTAAGAG CTCATGGTTT GCTTCATGAC
GATGTTGGAA TCTACCGCGA AGATAGTGTT GGGGATATGA AGCAGCCGTT TTACAATTTT
ACTTATATTG ATAAGATATA TGATTCATTT TTGGAGCTTG GAATACGGCC TTTTGTTGAG
ATAGGATTTA TGCCGTCAAA ACTTGCATCT GGAACACAAA CAGTATTTTA CTGGAGGGGT
AATGTTACCC CTCCCAGTGA TTATGGAAAG TGGGAGAAGC TAATTAAAGC AGTTGTTAAA
CACTTCATAG ACAGATATGG CGAAAAAGAG GTTGAAAACT GGCCGTTTGA AATATGGAAC
GAACCAAATT TAAATGTATT TTGGAAAGAT GCTGATCAAA ATGAATATTT TAAGCTATAT
GAAGTGACAG CAAAGGCTAT AAAAGATGTA AATGAGAATA TAAAGGTTGG TGGGCCTGCA
ATATGTGGCG GGGCAGACCA CTGGATAGAC GATTTTTTGA ATTTTTGTTA TAAAAATAAT
GTTTCTGTTG ATTTTGTTAC ACGACATGCG TATACAGCAA AACCCCCTAC TTATACACCA
CATTTTGTTT ATCACGATTT ACATCCAATT GATTACATGT TAAACGAATT TAAAATGGTA
CGAGAGCAGG TAAAAAATTC ACCGTTTCCA AATTTGCCGA TACATATTAC TGAATACAAC
AGTTCATACC ATCCGCTTTG CCCTGTTCAC GACACGCCGT TTAATGCGGC ATACCTTGCA
AGGATATTAA GCGAAGGAGG AGATTATGTA GATTCGTTTT CTTATTGGAC ATTCAGTGAT
GTATTTGAAG AAGCAGATGT ACCAAGGTCT CTGTTTCATG GTGGGTTTGG CCTTGTAGCT
TTCAATAATA TTCCAAAACC TGTGTTTCAC ATGTTTACAT TTTTCAATGC AATGGGAAGA
GATATTTTGT ATAGAGATGA CCATATCTTG GTAACAAAAA GAGCAGATGG TTCAGTTGCG
ATTGTGGCAT GGAATGAAGT TATAAGTAAA GAACAAGAGA TTGAAAGAGA ATACAAGCTG
GAAATTCCTA TTGACTTTGA GGATATTTTT GTAAAGCAAA AATTAATTGA CGAAGAACAT
GCAAATCCAT GGCGTGTATG GATTGAGATG GGAAGACCAA GGTATCCGTC AAAGGAACAG
ATAAAAACTT TGAAGGAAAT TGCAAAACCG TATGTTAGCA CTTGCAGAAT GAGAGCAAGA
GAGGGTTATG TAACACTTAA TATCAAGTTA GGTAAGAATG CAGTTGTGCT TTATGAGCTC
AATAAAGTTA ATGACGAGAC GCATACATAT ATAGGGCTTG ATGATAGTAA AATTCCAGGT
TATTGA
 
Protein sequence
MTYVKIERGK IFGVFPDNWK FCVGSGRIGL ALQKEYMEAL EYVKKHIDFK YLRAHGLLHD 
DVGIYREDSV GDMKQPFYNF TYIDKIYDSF LELGIRPFVE IGFMPSKLAS GTQTVFYWRG
NVTPPSDYGK WEKLIKAVVK HFIDRYGEKE VENWPFEIWN EPNLNVFWKD ADQNEYFKLY
EVTAKAIKDV NENIKVGGPA ICGGADHWID DFLNFCYKNN VSVDFVTRHA YTAKPPTYTP
HFVYHDLHPI DYMLNEFKMV REQVKNSPFP NLPIHITEYN SSYHPLCPVH DTPFNAAYLA
RILSEGGDYV DSFSYWTFSD VFEEADVPRS LFHGGFGLVA FNNIPKPVFH MFTFFNAMGR
DILYRDDHIL VTKRADGSVA IVAWNEVISK EQEIEREYKL EIPIDFEDIF VKQKLIDEEH
ANPWRVWIEM GRPRYPSKEQ IKTLKEIAKP YVSTCRMRAR EGYVTLNIKL GKNAVVLYEL
NKVNDETHTY IGLDDSKIPG Y