Gene Athe_0183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0183 
Symbol 
ID7407174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp229475 
End bp231568 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content34% 
IMG OID643714585 
ProductCellulose 1,4-beta-cellobiosidase 
Protein accessionYP_002572108 
Protein GI222528226 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GGAAATTCAA AATATTATAT TTATTTTTAA TTATAGTACT TTCTGTATCA 
TTTATTATAT CAATAGTTTT TCCATCATTT TTTAAGGCGG CACAGACAAC CTCAACAAAC
ATAAACTTTG AAGGAAGAGA CAAGTTAACA TTTTTTGCAT ATGGCAAAGC AAAAATAACA
ATAGACCAAA ACATAGCACA AGAAGGAAAA AAGAGTATAA AAGTTACAGA CAGGAAAAGT
GTATGGGATA GCTTTGGGAT AGATGTAAAA GATGTTTTAC AAAGAGGAAA AACATGGGTG
GTATCAGCCT ATGTAAAACA TAAGGGGAAG AAGCCGATAG AATTTTCAAT AACAGCTATT
TATAATGACG GCAGGGGGTT AAAGTACCTT CAGCTTGGTG AGAAAATTGT CATACCAAAC
AAATGGGACA AAATTGTTGC TAAGTGGAAA CCAACGTTAA AAAACCCGAT GGACTTGATT
ATTGCAATTC ATCCAACAGT TGATAAAACA ACTGCATATA ATGTGGACAA TATTCAAATA
ATGACAGAAG AAGTTTATCA ATCACAAGCT GTTGTTTTTA AAGATACATT TGAATCAAAT
TTGACAAACT GGCAGCCAAG AGGTGATACT GTAAAACTAA AAATAGATAA TACAAAATCG
CATAATGGAA ATAAGAGTCT TTATGTATCA GGTCGTTCGG CATTCTGGCA TGGAGTTCAA
ATTCCTGTGA CAAAATATCT TGTTGCTGGG AAGGTATACA AATTTAGCGT ATGGCTGTAT
CATCAATCAA TTGACAAGCA AGGTTTTGGT CTTACCATTC AAAGAAAGAT GGCAAACGAT
GAACAATATA AATATGATTG GATAACTGGA AGCCAGATTG AAGGTGATGG CTGGGTTGAG
ATAAGTGGTA ATTATTATGT ACCAAAGGAT GGCAAAATAG AAGAACTTGT ATTTTGTGTT
TCTTCGTGGA ACCCAACATT AGCATTTTGG GTAGATGATG TTACAATATC TGATCCGTTT
AAGTTACAGG GACCTAATTA TAATTTGCCG TCTTTAAAAG AGAAATATAA AGAAGATTTT
AAAGTTGGTG TAGCTATTGG ATATGGTGAA CTTATTAGTG ATATAGACAC ACAATTTATC
AAAAAACATT TTAACAGTAT AACACCAGGC AACGAGATGA AACCCGAAAG TGTGCTAAAA
GGACCAAACA ACTATGACTT TACAATAGCG GATGCATTTG TGGATTTTGC AACAAAAAAT
AAAATGGGTA TACGCGGACA TACTCTTGTC TGGCACAACC AGACACCTGA TTGGTTCTTC
AAAGATGAGA ATGGCAATTT TTTAAAGAAG GATGAACTTT TGAAAAGGTT AAAAAATCAT
ATATACACAG TTGTTAGCCG GTATAAAGGC AAAATATATG CTTGGGATGT TGTCAATGAA
GCTATTGATG AAACACAACC TGATGGTTAC AGAAGGTCAA ACTGGTACAA TATTTGTGGA
CCCGAATATA TAGAAAAAGC GTTTATTTGG GCACATGAGG CAGATCCACA AGCAAAGTTA
TTTTACAATG ATTACAATAC CGAAATTCCA CAAAAGAGAA TGTTTATATA TAACATGATT
AAAAATTTGA AAGCAAAAGG TGTTCCAATA CATGGTATAG GTCTTCAATG TCACATAAAT
ATTGACAATC CTTCTGTTGA AGATATAGAG GAGACGATAA AACTATTTAG CACAATTCCA
GGGCTTGAGA TTCAAATTAC TGAGCTTGAC ATGAGCTTTT ATCAATGGGG TTCTTCTGTT
TATTACGCAG AGCCATCAAG AGAAATGTTA TTAAAACAGG CAAAGAAATA CTATGAGTTA
TTTAACCTAT TTAAGAAGTA CAAAAATGTC ATAAAAAGCG TTACATTCTG GGGGCTTAAG
GATGACAACT CTTGGCTGAG AGGAGTTTTT AACAAACCAG ATTTTCCGCT TTTATTTGAT
GAGCATTATG ATGGCAAACC TGCTTTCTGG GCGTTGATAG ACTATTCAAT ATTACCACAA
AATGCCAATT TGCCTACACC ACCTGCTATT CCAAAAGTAA AGGCTAAAAA ATAA
 
Protein sequence
MKKRKFKILY LFLIIVLSVS FIISIVFPSF FKAAQTTSTN INFEGRDKLT FFAYGKAKIT 
IDQNIAQEGK KSIKVTDRKS VWDSFGIDVK DVLQRGKTWV VSAYVKHKGK KPIEFSITAI
YNDGRGLKYL QLGEKIVIPN KWDKIVAKWK PTLKNPMDLI IAIHPTVDKT TAYNVDNIQI
MTEEVYQSQA VVFKDTFESN LTNWQPRGDT VKLKIDNTKS HNGNKSLYVS GRSAFWHGVQ
IPVTKYLVAG KVYKFSVWLY HQSIDKQGFG LTIQRKMAND EQYKYDWITG SQIEGDGWVE
ISGNYYVPKD GKIEELVFCV SSWNPTLAFW VDDVTISDPF KLQGPNYNLP SLKEKYKEDF
KVGVAIGYGE LISDIDTQFI KKHFNSITPG NEMKPESVLK GPNNYDFTIA DAFVDFATKN
KMGIRGHTLV WHNQTPDWFF KDENGNFLKK DELLKRLKNH IYTVVSRYKG KIYAWDVVNE
AIDETQPDGY RRSNWYNICG PEYIEKAFIW AHEADPQAKL FYNDYNTEIP QKRMFIYNMI
KNLKAKGVPI HGIGLQCHIN IDNPSVEDIE ETIKLFSTIP GLEIQITELD MSFYQWGSSV
YYAEPSREML LKQAKKYYEL FNLFKKYKNV IKSVTFWGLK DDNSWLRGVF NKPDFPLLFD
EHYDGKPAFW ALIDYSILPQ NANLPTPPAI PKVKAKK