Gene Athe_1860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1860 
Symbol 
ID7408973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1947681 
End bp1953395 
Gene Length5715 bp 
Protein Length1904 aa 
Translation table11 
GC content44% 
IMG OID643716232 
Productglycoside hydrolase family 48 
Protein accessionYP_002573721 
Protein GI222529839 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.738055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAGGG GTATAGCGTT ATTTATTGCA ATTATCTTTT TATGTACAGT AATGTATAGC 
TTAACTTTTC CAGTGAAGGC AGAGAGTATA GTCACCACTC AAAAATACAT TTGGAAAAAT
GTAAAAATTG AAGGTGGCGG CGGTTTTATT ACGGGAATAG TTTTTAATCC TAAAGAAAAA
AACTTAGTTT ATGTAAGAAC AGATATTGGC GGTGCTTACA GAAGTACAGA TGGTGGCAAT
ACGTGGACAC AGCTTATGAA TTGGGTTAGT TTTGATGAGT GGAATTTATT AGGCGTAGAA
AGTATAGCAA CAGATCCTGT TGATCCAAAT CGACTCTACA TTGCTGCTGG TACTTACACT
AATAGTTGGA CAAATATGAA TGGGGTCCTT TTACGTTCAA AAGATAAAGG TAATACATTT
GAACGTACTC CTTTACCATT CAAATTGGGC GGTAATATGC CTGGTCGAAA TATGGGAGAG
AGATTAGCTA TTGATCCGAA TCACAATAAT ATCTTATATT TAGGTACACG CGAAGGAAAT
GGACTATGGA AAAGTATAGA TTATGGCGTA ACGTGGAAAA AAGTAGAGAA CTTCCCGAAC
CCAGGAACAT ATGTTGAAGA CCCGAATTGT CCTCATGATT ATTTGAATCA TATTACAGGT
GTAGTATGGG TTGTATTTGA TCCAACGAGC GGTAAACCCG GAGAAGGAAG CAAAGTGATT
TATGTAGGGG TAGCTGATAA AAATACAAGC ATATACTATA CAAAAGATGG TGGGCAAACA
TGGGAGGCAC TACCAGGACA ACCTACAGGT TTACTTCCAC AACATGCAAA GTTAAGCTCA
GATGGGATGC TGTATATAAC ATATAGCAAT ACTCAGGGCC CTTACAACGG TGACTATGGA
GAAGTATGGA GATATAATAC TAAAACTGGC GAATGGAAGA ACATAAGTCC AATGCCAGCA
AAAGATACTT ATTTTGGATA TGGAGGATTA GCTGTTGATG CTCAAAATCC AAAGGTGGTA
ATGGTAGCAG CTTTGAGTTC ATGGTGGCCT GATACTTATA TTTGGCGTAG TACAGATGGC
GGTGAAACAT GGAAGTGTAT ATGGGAATGG GCTGGATATC CTAACAGAAC GCTTCACTAT
ACTATGGATA TTTCTGCAGC TCCATGGTTG AATTTTGGTA TTACGAATCC AATTCCTCCC
GAGACAAGTC CAAAGCTTGG TTGGATGGTT GCAGCCCTTG AGATTGATCC ATTTAACTCT
GATAGAATGT TATATGGGAC AGGTGCTACG TTATATGGGT GTGATGATTT AACTAAATGG
GATAAAGGTG AAAAGATAAC AATTAAAGTA AAAGGAATTG GAATAGAAGA AACATCAGTT
GCAGCATTAG TAAGCCCACC AATCGGACCA CATTTATTTA GTGCAATATA CGATATTGCT
GGGTTTAGAC ATGATGATTT GGAGAAAGCA CCCAACTGGA CATATACTCA ACCCAATATG
GGTTCGACTA CTGATATAGA TTACGCTGAA TTGAATCCAA ACTTTATGGT GCGTGTAGGG
AATGTTGACA AAACATGGAA TCCAAACACA AATAGAATTG GTTTTTCGTA TGATGGTGGG
AAATCATGGT TCCAAGGTAA TCAAGAACCT CAAGGAATGA CAGAAGGTGG AACAGTTGCT
GCAGCTGCTG ATGCTAGTGT AGTTGTGTGG GCACCAAAAG GAGCTCCTGT ATCCTATTCC
ACAGACAATG GTAATACATG GTATCAATGT GCCAATGTAC CTTCCGGAGC GATTGTATAC
TCTGATAGAG TTAATCCAAA CAAGTTTTAT GCTTTTAAAG ATGGAAAATT CTACATTAGT
AGTGACAAAG GAAAAACCTT TGTTCAATCA CCAGCTTCTG GCTTGCCAAC AAGCGGGAAC
TTTAAAGCTG TTCCTGGTCG TGAAGGTGAT ATATGGTTAG TCGGCAATAA TGGAATGTGG
CATTCGGTTG ATGGTGGTTA TACGTTTACA AAAATTGGAA ATGTAGAAGA AGCAGCAAGT
ATTGGGTTTG GTAAACCTGC AGAGGGAAAA ACATATCCGG CAATCTACAC TTATGCAAAG
ATAAATGGAA TCAGAGGAAT ATTTAGATCT GATGATGCAG GTGCAAGTTG GGTAAGAATA
AATGATGATA ATAATCAATT TGGTTGTGCT AATGCAGACA TTACTGGAGA CCAAAGAGTA
TATGGAAGAG TATTCGTTGC TACTAATGGT TTAGGAATAA AATGGGGTGA AATTGCTGAT
AGTGCTACAA TTCCAACATC AACGCCAGCA CCAACAGTAA CTGCTACACT GACCCCAACA
CCAACTGCTA CACCAACGCC AACACCAACA CCAACACCAA CACCAACATC AACTGCTACA
CCAACACCGA CACCTACACC AACACCAACG TCAACACCAA CTGCTACACC AACAGCAACG
CCAACACCAA CACCGACGCC GAGCAGCACA CCTGTAGCAG GTGGACAGAT AAAGGTATTG
TATGCTAACA AGGAGACAAA TAGCACAACA AATACGATAA GGCCATGGTT GAAGGTAGTG
AACACTGGAA GTAGCAGCAT AGATTTGAGC AGGGTAACGA TAAGGTACTG GTACACGGTA
GATGGGGACA AGGCACAGAG TGCGATATCA GACTGGGCAC AGATAGGAGC AAGCAATGTG
ACATTCAAGT TTGTGAAGCT GAGCAGTAGC GTAAGTGGAG CGGACTATTA TTTAGAGATA
GGATTTAAGA GTGGAGCTGG GCAGTTGCAG GCTGGTAAAG ACACAGGGGA GATACAGATA
AGGTTTAACA AGGATGACTG GAGCAATTAC AATCAGGGGA ATGACTGGTC ATGGATGCAG
AGCATGACGA GTTATGGAGA GAATGTGAAG GTAACAGCGT ATATAGATGG TGTATTGGTA
TGGGGACAGG AGCCGAGTGG AGCGACACCA ACACCGACAG CAACACCAGC ACCGACAGTG
ACACCGACAC CCACACCAGC ACCAACTCCA ACCCCGACCC CAACACCAAC TGCTACACCA
ACAGCAACGC CAACACCAAC ACCGACGCCA ACACCAACCC CAACCGCGAC ACCAACAGTA
ACAGCAACAC CAACACCGAC GCCGAGCAGC ACACCTGTAG CAGGTGGACA GATAAAGGTA
TTGTATGCTA ACAAGGAGAC AAATAGCACA ACAAATACGA TAAGGCCATG GTTGAAGGTA
GTGAACACTG GAAGCAGCAG CATAGATTTG AGCAGGGTAA CGATAAGGTA CTGGTACACG
GTAGATGGGG ACAAGGCACA GAGTGCGATA TCAGACTGGG CACAGATAGG AGCAAGCAAT
GTGACATTCA AGTTTGTGAA GCTGAGCAGT AGCGTAAGTG GAGCGGACTA TTATTTAGAG
ATAGGATTTA AGAGTGGAGC TGGGCAGTTG CAGGCTGGTA AAGACACAGG GGAGATACAG
ATAAGGTTTA ACAAGGATGA CTGGAGCAAT TACAATCAGG GGAATGACTG GTCATGGATG
CAGAGCATGA CGAGTTATGG AGAGAATGTG AAGGTAACAG CGTATATAGA TGGTGTATTG
GTATGGGGAC AGGAGCCGAG TGGAGCGACA CCAACACCGA CAGCAACACC AGCACCGACA
GTGACACCGA CACCCACACC AGCACCAACT CCAACCCCGA CCCCAACACC AACTGCTACA
CCAACAGCAA CGCCAACACC AACACCGACG CCAACACCAA CCCCAACCGC GACACCAACA
GTAACAGCAA CACCAACACC GACGCCGAGC AGCACACCGA GTGTGCTTGG CGAATATGGG
CAGAGGTTTA TGTGGTTATG GAACAAGATA CATGATCCTG CGAACGGGTA TTTTAACCAG
GATGGGATAC CATATCATTC GGTAGAGACA TTGATATGCG AAGCACCTGA TTATGGTCAT
TTGACCACGA GTGAGGCATT TTCGTACTAT GTATGGTTAG AGGCAGTGTA TGGTAAGTTA
ACGGGTGACT GGAGCAAATT TAAGACAGCA TGGGACACAT TAGAGAAGTA TATGATACCA
TCAGCGGAAG ATCAGCCGAT GAGGTCATAT GATCCTAACA AGCCAGCGAC ATACGCAGGG
GAGTGGGAGA CACCGGACAA GTATCCATCG CCGTTGGAGT TTAATGTACC TGTTGGCAAA
GACCCGTTGC ATAATGAACT TGTGAGCACA TATGGTAGCA CATTAATGTA TGGTATGCAC
TGGTTGATGG ACGTAGACAA CTGGTATGGA TATGGCAAGA GAGGGGACGG AGTAAGTCGG
GCATCATTTA TCAACACGTT CCAGAGAGGG CCTGAGGAGT CTGTATGGGA GACGGTGCCG
CATCCGAGCT GGGAGGAATT CAAGTGGGGC GGACCGAATG GATTTTTAGA TTTGTTTATT
AAGGATCAGA ACTATTCGAA GCAGTGGAGA TATACGGATG CACCAGATGC TGATGCGAGA
GCTATTCAGG CTACTTATTG GGCGAAAGTA TGGGCGAAGG AGCAAGGTAA GTTTAATGAG
ATAAGCAGCT ATGTAGCGAA GGCAGCGAAG ATGGGAGACT ATTTAAGGTA TGCGATGTTT
GACAAGTATT TCAAGCCATT AGGATGTCAG GATAAGAATG CGGCTGGAGG AACGGGGTAT
GACAGTGCAC ATTATCTGCT ATCATGGTAT TATGCATGGG GTGGAGCATT GGATGGAGCA
TGGTCATGGA AGATAGGGAG CAGCCATGTG CACTTTGGAT ATCAGAATCC GATGGCGGCA
TGGGCATTAG CGAATGATAG TGATATGAAG CCGAAGTCGC CGAATGGAGC GAGTGACTGG
GCAAAGAGTT TGAAGAGGCA GATAGAATTT TACAGGTGGT TACAGTCAGC GGAGGGAGCG
ATAGCAGGAG GCGCGACAAA TTCATGGAAT GGCAGATATG AGAAGTATCC AGCAGGGACA
GCAACATTTT ATGGAATGGC ATATGAACCG AATCCGGTAT ATCATGATCC TGGGAGCAAC
ACATGGTTTG GATTCCAGGC ATGGTCGATG CAGAGGGTAG CGGAGTATTA CTATGTGACA
GGAGATAAGG ACGCAGGAGC ACTGCTTGAG AAGTGGGTAA GCTGGGTTAA GAGTGTAGTG
AAGTTGAATA GTGATGGTAC GTTTGCGATA CCGTCGACGC TTGATTGGAG CGGACAACCT
GATACATGGA ACGGGGCGTA TACAGGGAAT AGCAACTTAC ATGTTAAGGT AGTGGACTAT
GGTACTGACT TAGGAATAAC AGCGTCATTG GCGAATGCGT TGTTGTACTA TAGTGCAGGG
ACGAAGAAGT ATGGGGTATT TGATGAGGGA GCGAAGAATT TAGCGAAGGA ATTGCTGGAC
AGGATGTGGA AGTTGTACAG GGATGAGAAG GGATTGTCAG CGCCAGAGAA GAGAGCGGAC
TACAAGAGGT TCTTTGAGCA AGAGGTATAT ATACCGGCAG GATGGATAGG GAAGATGCCG
AATGGAGATG TAATAAAGAG TGGAGTTAAG TTTATAGACA TAAGGAGCAA GTATAAACAA
GATCCTGATT GGCCGAAGTT AGAGGCGGCA TACAAGTCAG GGCAGGCACC TGAGTTCAGA
TATCACAGGT TCTGGGCACA GTGCGACATA GCAATAGCTA ATGCAACATA TGAAATACTG
TTTGGCAATC AATAA
 
Protein sequence
MRRGIALFIA IIFLCTVMYS LTFPVKAESI VTTQKYIWKN VKIEGGGGFI TGIVFNPKEK 
NLVYVRTDIG GAYRSTDGGN TWTQLMNWVS FDEWNLLGVE SIATDPVDPN RLYIAAGTYT
NSWTNMNGVL LRSKDKGNTF ERTPLPFKLG GNMPGRNMGE RLAIDPNHNN ILYLGTREGN
GLWKSIDYGV TWKKVENFPN PGTYVEDPNC PHDYLNHITG VVWVVFDPTS GKPGEGSKVI
YVGVADKNTS IYYTKDGGQT WEALPGQPTG LLPQHAKLSS DGMLYITYSN TQGPYNGDYG
EVWRYNTKTG EWKNISPMPA KDTYFGYGGL AVDAQNPKVV MVAALSSWWP DTYIWRSTDG
GETWKCIWEW AGYPNRTLHY TMDISAAPWL NFGITNPIPP ETSPKLGWMV AALEIDPFNS
DRMLYGTGAT LYGCDDLTKW DKGEKITIKV KGIGIEETSV AALVSPPIGP HLFSAIYDIA
GFRHDDLEKA PNWTYTQPNM GSTTDIDYAE LNPNFMVRVG NVDKTWNPNT NRIGFSYDGG
KSWFQGNQEP QGMTEGGTVA AAADASVVVW APKGAPVSYS TDNGNTWYQC ANVPSGAIVY
SDRVNPNKFY AFKDGKFYIS SDKGKTFVQS PASGLPTSGN FKAVPGREGD IWLVGNNGMW
HSVDGGYTFT KIGNVEEAAS IGFGKPAEGK TYPAIYTYAK INGIRGIFRS DDAGASWVRI
NDDNNQFGCA NADITGDQRV YGRVFVATNG LGIKWGEIAD SATIPTSTPA PTVTATLTPT
PTATPTPTPT PTPTPTSTAT PTPTPTPTPT STPTATPTAT PTPTPTPSST PVAGGQIKVL
YANKETNSTT NTIRPWLKVV NTGSSSIDLS RVTIRYWYTV DGDKAQSAIS DWAQIGASNV
TFKFVKLSSS VSGADYYLEI GFKSGAGQLQ AGKDTGEIQI RFNKDDWSNY NQGNDWSWMQ
SMTSYGENVK VTAYIDGVLV WGQEPSGATP TPTATPAPTV TPTPTPAPTP TPTPTPTATP
TATPTPTPTP TPTPTATPTV TATPTPTPSS TPVAGGQIKV LYANKETNST TNTIRPWLKV
VNTGSSSIDL SRVTIRYWYT VDGDKAQSAI SDWAQIGASN VTFKFVKLSS SVSGADYYLE
IGFKSGAGQL QAGKDTGEIQ IRFNKDDWSN YNQGNDWSWM QSMTSYGENV KVTAYIDGVL
VWGQEPSGAT PTPTATPAPT VTPTPTPAPT PTPTPTPTAT PTATPTPTPT PTPTPTATPT
VTATPTPTPS STPSVLGEYG QRFMWLWNKI HDPANGYFNQ DGIPYHSVET LICEAPDYGH
LTTSEAFSYY VWLEAVYGKL TGDWSKFKTA WDTLEKYMIP SAEDQPMRSY DPNKPATYAG
EWETPDKYPS PLEFNVPVGK DPLHNELVST YGSTLMYGMH WLMDVDNWYG YGKRGDGVSR
ASFINTFQRG PEESVWETVP HPSWEEFKWG GPNGFLDLFI KDQNYSKQWR YTDAPDADAR
AIQATYWAKV WAKEQGKFNE ISSYVAKAAK MGDYLRYAMF DKYFKPLGCQ DKNAAGGTGY
DSAHYLLSWY YAWGGALDGA WSWKIGSSHV HFGYQNPMAA WALANDSDMK PKSPNGASDW
AKSLKRQIEF YRWLQSAEGA IAGGATNSWN GRYEKYPAGT ATFYGMAYEP NPVYHDPGSN
TWFGFQAWSM QRVAEYYYVT GDKDAGALLE KWVSWVKSVV KLNSDGTFAI PSTLDWSGQP
DTWNGAYTGN SNLHVKVVDY GTDLGITASL ANALLYYSAG TKKYGVFDEG AKNLAKELLD
RMWKLYRDEK GLSAPEKRAD YKRFFEQEVY IPAGWIGKMP NGDVIKSGVK FIDIRSKYKQ
DPDWPKLEAA YKSGQAPEFR YHRFWAQCDI AIANATYEIL FGNQ