Gene Athe_0187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0187 
Symbol 
ID7407178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp235084 
End bp236250 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content37% 
IMG OID643714589 
Productglycoside hydrolase family 39 
Protein accessionYP_002572112 
Protein GI222528230 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3664] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA ATATTTATCC AGAAATTAAG TTAGGCAAGA TAAATAAATT CTGGACAAAG 
TGTGTAGGCA GCTGTCATGC AGCAACTGCC CTGAGGGAGG ATTGGAGAAA GCAGCTGAAA
AAGTGTAGGC AAGAACTTGG GTTTAAATAT GTCAGATTTC ATGGCTGGCT TAACGATGAT
ATGAGTGTTT GCTTCAGAAA TGATGAGGGC AAACTTTGGT TTTCATTTTT CAACATTGAT
TCTATTATTG ACTTTTTATT GGATATTGGA ATGAAACCTT TTATAGAATT GAGCTTTATG
CCTGAAGCTT TAGCATCTGG TACAAAGACA GTTTTTCATT ATAAGGGCAA TATAACTCCG
CCAAAATCCT ATGAAGAGTG GGGACAACTT GTTAAAGAAC TTACAAGACA TCTTGTAAAA
AGATATGGTA AAAATGAGGT AAGAGAGTGG TTTTTTGAGG TATGGAACGA ACCGAATTTG
AAGGACTTTT TCTGGGCAGG GACAATGGAG GAGTATTTTG AGCTTTATAA GCATGCAGCG
TTTGCAATCA AGATAGTTGA CTCTGAACTG AAAGTAGGAG GACCTGCATC AGCAGTAGAT
GCGTGGATTT TGGAGCTGAA AGAGTATTGT CAGAAAAATG GAGTACCAAT AGATTTTATA
ACAACACACC AGTATCCAAC AGATTTAGCG TTTAGCACAA GTTCTAATAT GGAAGAAGCC
ATGGCAAAAG CAAAAAGAGG TGAGCTTGCT GAAAGAGTAA AAAAGGCTTT GAGTGAAGCA
GAACCTTTGC CTTTGTATTA TACTGAGTGG AATAACTCGC CGAGTCCGCG AGACCCGTAC
CACGATATTC CATATGATGC AGCATTTATT GTGAAGACTA TAATAGACAT TATTGATTTG
CCGCTTGGTT GTTACTCATA CTGGACATTC ACAGATATAT TTGAAGAGTG TGGGCTAAGT
TCATTGCCTT TCCATGGAGG GTTTGGACTT TTAAATATTC ACGGTATACC AAAGCCGTCA
TATAGAGCAT TTCAAATATT GAATAAATTA GATGGAGAAA AGGTAGAACT TAAAGTTGAA
GAAAAAAGTC CAACTGTAGA CTGCATAGCA GCTATAAATG GCAATGAATT AGTTCTTGTT
ATTTCAAACC ATAATGTTCC GCCTTAG
 
Protein sequence
MKINIYPEIK LGKINKFWTK CVGSCHAATA LREDWRKQLK KCRQELGFKY VRFHGWLNDD 
MSVCFRNDEG KLWFSFFNID SIIDFLLDIG MKPFIELSFM PEALASGTKT VFHYKGNITP
PKSYEEWGQL VKELTRHLVK RYGKNEVREW FFEVWNEPNL KDFFWAGTME EYFELYKHAA
FAIKIVDSEL KVGGPASAVD AWILELKEYC QKNGVPIDFI TTHQYPTDLA FSTSSNMEEA
MAKAKRGELA ERVKKALSEA EPLPLYYTEW NNSPSPRDPY HDIPYDAAFI VKTIIDIIDL
PLGCYSYWTF TDIFEECGLS SLPFHGGFGL LNIHGIPKPS YRAFQILNKL DGEKVELKVE
EKSPTVDCIA AINGNELVLV ISNHNVPP