Gene Athe_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1857 
Symbol 
ID7408970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1938607 
End bp1943043 
Gene Length4437 bp 
Protein Length1478 aa 
Translation table11 
GC content45% 
IMG OID643716229 
Productglycoside hydrolase family 48 
Protein accessionYP_002573718 
Protein GI222529836 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00499056 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGA AATTAGTTAA AATTATAACT CACGTGGTAT TGATTACATT TATAGCAGGT 
GTTTGTTTGT TTGGTACGAT GAGTTATTAT CCAATTGAAA CCAAAGCAGC ACCTGACTGG
AACATTCCAA GTTTATATGA GAGTTATAAG AATGATTTTA GGATAGGTGT AGCAATACCT
GCAAAATGTT TGAGCAACGA TACAGACAGA AGGATGGTAT TGAAACATTT TAACAGTATA
ACAGCAGAGA ATGAAATGAA GCCAGAAAGT TTATTAGCTG GGCAAACAAG TACAGGCTTA
AATTATAGAT TTAGCACTGC TGATACCTTT GTAGACTTTG CGAATACGAA CAATATAGGA
ATTAGAGGAC ATACTTTAGT GTGGCACAGC CAGACGCCTG ATTGGTTTTT TAAAGACAGC
AGTGGACAAA GGTTAACCAA GGATGCGTTG TTAGCAAGAT TGAAGCAATA TATATATGAT
GTTGTAGGAA GGTATAAGGG AAAAGTATAC GCATGGGACG TTGTAAATGA AGCCATTGAT
GAGAATCAGT CTGATGGTTA TAGGCGGTCG ACATGGTATG AGATTTGTGG ACCAGAATAC
ATTGAGAAAG CATTTATATG GGCACATGAA GCAGATCCCA ACGCAAAGCT ATTTTATAAT
GACTATAATA CTGAGATTTC CAAGAAAAGA GATTTTATAT ACAACATGGT TAAGAATTTA
AAATCAAAAG GAATACCAAT ACATGGTATT GGAATGCAAT GCCATATAAA TGTTAACTGG
CCTTCAGTTA GCGAAATAGA GAACAGCATA AAGTTATTCA GTTCAATACC GGGCATAGAA
ATTCATATAA CAGAGCTTGA TATGAGTTTA TACAATTACG GATCAAGCGA AAACTATTCG
ACACCCCCAC AGGACTTACT TCAGAAGCAA GCTCAGAAGT ACAAAGAATT ATTTACAATG
TTAAAGAAGT ATACGAATGT TGTTAAATGT GTAACATTTT GGGGATTAAA AGATGATTAT
TCATGGTTAA GATCTTTTAA CGGCAAAAAT GATTGGCCAT TATTGTTTTT TGAGGATTAC
AGTGCAAAAC CAGCTTATTG GGCAGTAATA GAAGCATCAG GGACATCAAC AACACCAGCT
CCAACTACAA CTATTACGCC GACCCCAACA CCAACACCAA CACTGACTCC AACACCGACA
CCTACACCAA CACCAACGTC AACACCAACT GCTACACCAA CAGCAACGCC AACACCAACA
CCGACGCCGA GCAGCACACC TGTAGCAGGT GGACAGATAA AGGTATTGTA TGCTAACAAG
GAGACAAATA GCACAACAAA CACGATAAGG CCATGGTTGA AGGTAGTGAA CACTGGAAGC
AGCAGCATAG ATTTGAGCAG GGTAACGATA AGGTACTGGT ACACGGTAGA TGGGGACAAG
GCACAGAGTG CGATATCAGA CTGGGCACAG ATAGGAGCAA GCAATGTGAC ATTCAAGTTT
GTGAAGCTGA GCAGTAGCGT AAGTGGAGCG GACTATTATT TAGAGATAGG ATTTAAGAGT
GGAGCTGGGC AGTTGCAGGC TGGTAAAGAC ACAGGGGAGA TACAGATAAG GTTTAACAAG
AGTGACTGGA GCAATTACAA TCAGGGGAAT GACTGGTCAT GGATGCAGAG CATGACGAAT
TATGGAGAGA ATGTGAAGGT AACAGCGTAT ATAGATGGTG TATTGGTATG GGGACAGGAG
CCGAGTGGAG CGACACCAAC ACCGACAGCA ACACCAGCAC CGACAGTGAC ACCGACACCT
ACACCAACAC CAACGTCAAC ACCAACTGCT ACACCAACAG CAACGCCAAC ACCAACACCG
ACGCCGAGCA GCACACCTGT AGCAGGTGGA CAGATAAAGG TATTGTATGC TAACAAGGAG
ACAAATAGCA CAACAAACAC GATAAGGCCA TGGTTGAAGG TAGTGAACAC TGGAAGCAGC
AGCATAGATT TGAGCAGGGT AACGATAAGG TACTGGTACA CGGTAGATGG GGACAAGGCA
CAGAGTGCGA TATCAGACTG GGCACAGATA GGAGCAAGCA ATGTGACATT CAAGTTTGTG
AAGCTGAGCA GTAGCGTAAG TGGAGCGGAC TATTATTTAG AGATAGGATT TAAGAGTGGA
GCTGGGCAGT TGCAGGCTGG TAAAGACACA GGGGAGATAC AGATAAGGTT TAACAAGAGT
GACTGGAGCA ATTACAATCA GGGGAATGAC TGGTCATGGA TGCAGAGCAT GACGAGTTAT
GGAGAGAATG TGAAGGTAAC AGCGTATATA GATGGTGTAT TGGTATGGGG ACAGGAGCCG
AGTGGAGCGA CACCAACACC GACAGCAACA CCAGCACCGA CAGTGACACC GACACCCACA
CCAGCACCAA CTCCAACCCC GACCCCAACA CCAACTGCTA CACCAACAGC AACGCCAACA
CCAACACCGA CGCCAACACC AACCCCAACC GCGACACCAA CAGTAACAGC AACACCAACA
CCGACGCCGA GCAGCACACC GAGTGTGCTT GGCGAATATG GGCAGAGGTT TATGTGGTTA
TGGAACAAGA TACATGATCC TGCGAACGGG TATTTTAACC AGGATGGGAT ACCATATCAT
TCGGTAGAGA CATTGATATG CGAAGCACCT GATTATGGTC ATTTGACCAC GAGTGAGGCA
TTTTCGTACT ATGTATGGTT AGAGGCAGTG TATGGTAAGT TAACGGGTGA CTGGAGCAAA
TTTAAGACAG CATGGGACAC ATTAGAGAAG TATATGATAC CATCAGCGGA AGATCAGCCG
ATGAGGTCAT ATGATCCTAA CAAGCCAGCG ACATACGCAG GGGAGTGGGA GACACCGGAC
AAGTATCCAT CGCCGTTGGA GTTTAATGTA CCTGTTGGCA AAGACCCGTT GCATAATGAA
CTTGTGAGCA CATATGGTAG CACATTAATG TATGGTATGC ACTGGTTGAT GGACGTAGAC
AACTGGTATG GATATGGCAA GAGAGGGGAC GGAGTAAGTC GGGCATCATT TATCAACACG
TTCCAGAGAG GGCCTGAGGA GTCTGTATGG GAGACGGTGC CGCATCCGAG CTGGGAGGAA
TTCAAGTGGG GCGGACCGAA TGGATTTTTA GATTTGTTTA TTAAGGATCA GAACTATTCG
AAGCAGTGGA GATATACGGA TGCACCAGAT GCTGATGCGA GAGCTATTCA GGCTACTTAT
TGGGCGAAAG TATGGGCGAA GGAGCAAGGT AAGTTTAATG AGATAAGCAG CTATGTAGCG
AAGGCAGCGA AGATGGGAGA CTATTTAAGG TATGCGATGT TTGACAAGTA TTTCAAGCCA
TTAGGATGTC AGGATAAGAA TGCGGCTGGA GGAACGGGGT ATGACAGTGC ACATTATCTG
CTATCATGGT ATTATGCATG GGGTGGAGCA TTGGATGGAG CATGGTCATG GAAGATAGGG
AGCAGCCATG TGCACTTTGG ATATCAGAAT CCGATGGCGG CATGGGCATT AGCGAATGAT
AGTGATATGA AGCCGAAGTC GCCGAATGGA GCGAGTGACT GGGCAAAGAG TTTGAAGAGG
CAGATAGAAT TTTACAGGTG GTTACAGTCA GCGGAGGGAG CGATAGCAGG AGGCGCGACA
AATTCATGGA ATGGCAGATA TGAGAAGTAT CCAGCAGGGA CAGCAACATT TTATGGAATG
GCATATGAAC CGAATCCGGT ATATCATGAT CCTGGGAGCA ACACATGGTT TGGATTCCAG
GCATGGTCGA TGCAGAGGGT AGCGGAGTAT TACTATGTGA CAGGAGATAA GGACGCAGGA
GCACTGCTTG AGAAGTGGGT AAGCTGGGTT AAGAGTGTAG TGAAGTTGAA TAGTGATGGT
ACGTTTGCGA TACCGTCGAC GCTTGATTGG AGCGGACAAC CTGATACATG GAACGGGGCG
TATACAGGGA ATAGCAACTT ACATGTTAAG GTAGTGGACT ATGGTACTGA CTTAGGAATA
ACAGCGTCAT TGGCGAATGC GTTGTTGTAC TATAGTGCAG GGACGAAGAA GTATGGGGTA
TTTGATGAGG GAGCGAAGAA TTTAGCGAAG GAATTGCTGG ACAGGATGTG GAAGTTGTAC
AGGGATGAGA AGGGATTGTC AGCGCCAGAG AAGAGAGCGG ACTACAAGAG GTTCTTTGAG
CAAGAGGTAT ATATACCGGC AGGATGGATA GGGAAGATGC CGAATGGAGA TGTAATAAAG
AGTGGAGTTA AGTTTATAGA CATAAGGAGC AAGTATAAAC AAGATCCTGA TTGGCCGAAG
TTAGAGGCGG CATACAAGTC AGGGCAGGCA CCTGAGTTCA GATATCACAG GTTCTGGGCA
CAGTGCGACA TAGCAATAGC TAATGCAACA TATGAAATAC TGTTTGGCAA TCAATAA
 
Protein sequence
MMKKLVKIIT HVVLITFIAG VCLFGTMSYY PIETKAAPDW NIPSLYESYK NDFRIGVAIP 
AKCLSNDTDR RMVLKHFNSI TAENEMKPES LLAGQTSTGL NYRFSTADTF VDFANTNNIG
IRGHTLVWHS QTPDWFFKDS SGQRLTKDAL LARLKQYIYD VVGRYKGKVY AWDVVNEAID
ENQSDGYRRS TWYEICGPEY IEKAFIWAHE ADPNAKLFYN DYNTEISKKR DFIYNMVKNL
KSKGIPIHGI GMQCHINVNW PSVSEIENSI KLFSSIPGIE IHITELDMSL YNYGSSENYS
TPPQDLLQKQ AQKYKELFTM LKKYTNVVKC VTFWGLKDDY SWLRSFNGKN DWPLLFFEDY
SAKPAYWAVI EASGTSTTPA PTTTITPTPT PTPTLTPTPT PTPTPTSTPT ATPTATPTPT
PTPSSTPVAG GQIKVLYANK ETNSTTNTIR PWLKVVNTGS SSIDLSRVTI RYWYTVDGDK
AQSAISDWAQ IGASNVTFKF VKLSSSVSGA DYYLEIGFKS GAGQLQAGKD TGEIQIRFNK
SDWSNYNQGN DWSWMQSMTN YGENVKVTAY IDGVLVWGQE PSGATPTPTA TPAPTVTPTP
TPTPTSTPTA TPTATPTPTP TPSSTPVAGG QIKVLYANKE TNSTTNTIRP WLKVVNTGSS
SIDLSRVTIR YWYTVDGDKA QSAISDWAQI GASNVTFKFV KLSSSVSGAD YYLEIGFKSG
AGQLQAGKDT GEIQIRFNKS DWSNYNQGND WSWMQSMTSY GENVKVTAYI DGVLVWGQEP
SGATPTPTAT PAPTVTPTPT PAPTPTPTPT PTATPTATPT PTPTPTPTPT ATPTVTATPT
PTPSSTPSVL GEYGQRFMWL WNKIHDPANG YFNQDGIPYH SVETLICEAP DYGHLTTSEA
FSYYVWLEAV YGKLTGDWSK FKTAWDTLEK YMIPSAEDQP MRSYDPNKPA TYAGEWETPD
KYPSPLEFNV PVGKDPLHNE LVSTYGSTLM YGMHWLMDVD NWYGYGKRGD GVSRASFINT
FQRGPEESVW ETVPHPSWEE FKWGGPNGFL DLFIKDQNYS KQWRYTDAPD ADARAIQATY
WAKVWAKEQG KFNEISSYVA KAAKMGDYLR YAMFDKYFKP LGCQDKNAAG GTGYDSAHYL
LSWYYAWGGA LDGAWSWKIG SSHVHFGYQN PMAAWALAND SDMKPKSPNG ASDWAKSLKR
QIEFYRWLQS AEGAIAGGAT NSWNGRYEKY PAGTATFYGM AYEPNPVYHD PGSNTWFGFQ
AWSMQRVAEY YYVTGDKDAG ALLEKWVSWV KSVVKLNSDG TFAIPSTLDW SGQPDTWNGA
YTGNSNLHVK VVDYGTDLGI TASLANALLY YSAGTKKYGV FDEGAKNLAK ELLDRMWKLY
RDEKGLSAPE KRADYKRFFE QEVYIPAGWI GKMPNGDVIK SGVKFIDIRS KYKQDPDWPK
LEAAYKSGQA PEFRYHRFWA QCDIAIANAT YEILFGNQ