Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1860 |
Symbol | |
ID | 7408973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1947681 |
End bp | 1953395 |
Gene Length | 5715 bp |
Protein Length | 1904 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643716232 |
Product | glycoside hydrolase family 48 |
Protein accession | YP_002573721 |
Protein GI | 222529839 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.738055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAGGG GTATAGCGTT ATTTATTGCA ATTATCTTTT TATGTACAGT AATGTATAGC TTAACTTTTC CAGTGAAGGC AGAGAGTATA GTCACCACTC AAAAATACAT TTGGAAAAAT GTAAAAATTG AAGGTGGCGG CGGTTTTATT ACGGGAATAG TTTTTAATCC TAAAGAAAAA AACTTAGTTT ATGTAAGAAC AGATATTGGC GGTGCTTACA GAAGTACAGA TGGTGGCAAT ACGTGGACAC AGCTTATGAA TTGGGTTAGT TTTGATGAGT GGAATTTATT AGGCGTAGAA AGTATAGCAA CAGATCCTGT TGATCCAAAT CGACTCTACA TTGCTGCTGG TACTTACACT AATAGTTGGA CAAATATGAA TGGGGTCCTT TTACGTTCAA AAGATAAAGG TAATACATTT GAACGTACTC CTTTACCATT CAAATTGGGC GGTAATATGC CTGGTCGAAA TATGGGAGAG AGATTAGCTA TTGATCCGAA TCACAATAAT ATCTTATATT TAGGTACACG CGAAGGAAAT GGACTATGGA AAAGTATAGA TTATGGCGTA ACGTGGAAAA AAGTAGAGAA CTTCCCGAAC CCAGGAACAT ATGTTGAAGA CCCGAATTGT CCTCATGATT ATTTGAATCA TATTACAGGT GTAGTATGGG TTGTATTTGA TCCAACGAGC GGTAAACCCG GAGAAGGAAG CAAAGTGATT TATGTAGGGG TAGCTGATAA AAATACAAGC ATATACTATA CAAAAGATGG TGGGCAAACA TGGGAGGCAC TACCAGGACA ACCTACAGGT TTACTTCCAC AACATGCAAA GTTAAGCTCA GATGGGATGC TGTATATAAC ATATAGCAAT ACTCAGGGCC CTTACAACGG TGACTATGGA GAAGTATGGA GATATAATAC TAAAACTGGC GAATGGAAGA ACATAAGTCC AATGCCAGCA AAAGATACTT ATTTTGGATA TGGAGGATTA GCTGTTGATG CTCAAAATCC AAAGGTGGTA ATGGTAGCAG CTTTGAGTTC ATGGTGGCCT GATACTTATA TTTGGCGTAG TACAGATGGC GGTGAAACAT GGAAGTGTAT ATGGGAATGG GCTGGATATC CTAACAGAAC GCTTCACTAT ACTATGGATA TTTCTGCAGC TCCATGGTTG AATTTTGGTA TTACGAATCC AATTCCTCCC GAGACAAGTC CAAAGCTTGG TTGGATGGTT GCAGCCCTTG AGATTGATCC ATTTAACTCT GATAGAATGT TATATGGGAC AGGTGCTACG TTATATGGGT GTGATGATTT AACTAAATGG GATAAAGGTG AAAAGATAAC AATTAAAGTA AAAGGAATTG GAATAGAAGA AACATCAGTT GCAGCATTAG TAAGCCCACC AATCGGACCA CATTTATTTA GTGCAATATA CGATATTGCT GGGTTTAGAC ATGATGATTT GGAGAAAGCA CCCAACTGGA CATATACTCA ACCCAATATG GGTTCGACTA CTGATATAGA TTACGCTGAA TTGAATCCAA ACTTTATGGT GCGTGTAGGG AATGTTGACA AAACATGGAA TCCAAACACA AATAGAATTG GTTTTTCGTA TGATGGTGGG AAATCATGGT TCCAAGGTAA TCAAGAACCT CAAGGAATGA CAGAAGGTGG AACAGTTGCT GCAGCTGCTG ATGCTAGTGT AGTTGTGTGG GCACCAAAAG GAGCTCCTGT ATCCTATTCC ACAGACAATG GTAATACATG GTATCAATGT GCCAATGTAC CTTCCGGAGC GATTGTATAC TCTGATAGAG TTAATCCAAA CAAGTTTTAT GCTTTTAAAG ATGGAAAATT CTACATTAGT AGTGACAAAG GAAAAACCTT TGTTCAATCA CCAGCTTCTG GCTTGCCAAC AAGCGGGAAC TTTAAAGCTG TTCCTGGTCG TGAAGGTGAT ATATGGTTAG TCGGCAATAA TGGAATGTGG CATTCGGTTG ATGGTGGTTA TACGTTTACA AAAATTGGAA ATGTAGAAGA AGCAGCAAGT ATTGGGTTTG GTAAACCTGC AGAGGGAAAA ACATATCCGG CAATCTACAC TTATGCAAAG ATAAATGGAA TCAGAGGAAT ATTTAGATCT GATGATGCAG GTGCAAGTTG GGTAAGAATA AATGATGATA ATAATCAATT TGGTTGTGCT AATGCAGACA TTACTGGAGA CCAAAGAGTA TATGGAAGAG TATTCGTTGC TACTAATGGT TTAGGAATAA AATGGGGTGA AATTGCTGAT AGTGCTACAA TTCCAACATC AACGCCAGCA CCAACAGTAA CTGCTACACT GACCCCAACA CCAACTGCTA CACCAACGCC AACACCAACA CCAACACCAA CACCAACATC AACTGCTACA CCAACACCGA CACCTACACC AACACCAACG TCAACACCAA CTGCTACACC AACAGCAACG CCAACACCAA CACCGACGCC GAGCAGCACA CCTGTAGCAG GTGGACAGAT AAAGGTATTG TATGCTAACA AGGAGACAAA TAGCACAACA AATACGATAA GGCCATGGTT GAAGGTAGTG AACACTGGAA GTAGCAGCAT AGATTTGAGC AGGGTAACGA TAAGGTACTG GTACACGGTA GATGGGGACA AGGCACAGAG TGCGATATCA GACTGGGCAC AGATAGGAGC AAGCAATGTG ACATTCAAGT TTGTGAAGCT GAGCAGTAGC GTAAGTGGAG CGGACTATTA TTTAGAGATA GGATTTAAGA GTGGAGCTGG GCAGTTGCAG GCTGGTAAAG ACACAGGGGA GATACAGATA AGGTTTAACA AGGATGACTG GAGCAATTAC AATCAGGGGA ATGACTGGTC ATGGATGCAG AGCATGACGA GTTATGGAGA GAATGTGAAG GTAACAGCGT ATATAGATGG TGTATTGGTA TGGGGACAGG AGCCGAGTGG AGCGACACCA ACACCGACAG CAACACCAGC ACCGACAGTG ACACCGACAC CCACACCAGC ACCAACTCCA ACCCCGACCC CAACACCAAC TGCTACACCA ACAGCAACGC CAACACCAAC ACCGACGCCA ACACCAACCC CAACCGCGAC ACCAACAGTA ACAGCAACAC CAACACCGAC GCCGAGCAGC ACACCTGTAG CAGGTGGACA GATAAAGGTA TTGTATGCTA ACAAGGAGAC AAATAGCACA ACAAATACGA TAAGGCCATG GTTGAAGGTA GTGAACACTG GAAGCAGCAG CATAGATTTG AGCAGGGTAA CGATAAGGTA CTGGTACACG GTAGATGGGG ACAAGGCACA GAGTGCGATA TCAGACTGGG CACAGATAGG AGCAAGCAAT GTGACATTCA AGTTTGTGAA GCTGAGCAGT AGCGTAAGTG GAGCGGACTA TTATTTAGAG ATAGGATTTA AGAGTGGAGC TGGGCAGTTG CAGGCTGGTA AAGACACAGG GGAGATACAG ATAAGGTTTA ACAAGGATGA CTGGAGCAAT TACAATCAGG GGAATGACTG GTCATGGATG CAGAGCATGA CGAGTTATGG AGAGAATGTG AAGGTAACAG CGTATATAGA TGGTGTATTG GTATGGGGAC AGGAGCCGAG TGGAGCGACA CCAACACCGA CAGCAACACC AGCACCGACA GTGACACCGA CACCCACACC AGCACCAACT CCAACCCCGA CCCCAACACC AACTGCTACA CCAACAGCAA CGCCAACACC AACACCGACG CCAACACCAA CCCCAACCGC GACACCAACA GTAACAGCAA CACCAACACC GACGCCGAGC AGCACACCGA GTGTGCTTGG CGAATATGGG CAGAGGTTTA TGTGGTTATG GAACAAGATA CATGATCCTG CGAACGGGTA TTTTAACCAG GATGGGATAC CATATCATTC GGTAGAGACA TTGATATGCG AAGCACCTGA TTATGGTCAT TTGACCACGA GTGAGGCATT TTCGTACTAT GTATGGTTAG AGGCAGTGTA TGGTAAGTTA ACGGGTGACT GGAGCAAATT TAAGACAGCA TGGGACACAT TAGAGAAGTA TATGATACCA TCAGCGGAAG ATCAGCCGAT GAGGTCATAT GATCCTAACA AGCCAGCGAC ATACGCAGGG GAGTGGGAGA CACCGGACAA GTATCCATCG CCGTTGGAGT TTAATGTACC TGTTGGCAAA GACCCGTTGC ATAATGAACT TGTGAGCACA TATGGTAGCA CATTAATGTA TGGTATGCAC TGGTTGATGG ACGTAGACAA CTGGTATGGA TATGGCAAGA GAGGGGACGG AGTAAGTCGG GCATCATTTA TCAACACGTT CCAGAGAGGG CCTGAGGAGT CTGTATGGGA GACGGTGCCG CATCCGAGCT GGGAGGAATT CAAGTGGGGC GGACCGAATG GATTTTTAGA TTTGTTTATT AAGGATCAGA ACTATTCGAA GCAGTGGAGA TATACGGATG CACCAGATGC TGATGCGAGA GCTATTCAGG CTACTTATTG GGCGAAAGTA TGGGCGAAGG AGCAAGGTAA GTTTAATGAG ATAAGCAGCT ATGTAGCGAA GGCAGCGAAG ATGGGAGACT ATTTAAGGTA TGCGATGTTT GACAAGTATT TCAAGCCATT AGGATGTCAG GATAAGAATG CGGCTGGAGG AACGGGGTAT GACAGTGCAC ATTATCTGCT ATCATGGTAT TATGCATGGG GTGGAGCATT GGATGGAGCA TGGTCATGGA AGATAGGGAG CAGCCATGTG CACTTTGGAT ATCAGAATCC GATGGCGGCA TGGGCATTAG CGAATGATAG TGATATGAAG CCGAAGTCGC CGAATGGAGC GAGTGACTGG GCAAAGAGTT TGAAGAGGCA GATAGAATTT TACAGGTGGT TACAGTCAGC GGAGGGAGCG ATAGCAGGAG GCGCGACAAA TTCATGGAAT GGCAGATATG AGAAGTATCC AGCAGGGACA GCAACATTTT ATGGAATGGC ATATGAACCG AATCCGGTAT ATCATGATCC TGGGAGCAAC ACATGGTTTG GATTCCAGGC ATGGTCGATG CAGAGGGTAG CGGAGTATTA CTATGTGACA GGAGATAAGG ACGCAGGAGC ACTGCTTGAG AAGTGGGTAA GCTGGGTTAA GAGTGTAGTG AAGTTGAATA GTGATGGTAC GTTTGCGATA CCGTCGACGC TTGATTGGAG CGGACAACCT GATACATGGA ACGGGGCGTA TACAGGGAAT AGCAACTTAC ATGTTAAGGT AGTGGACTAT GGTACTGACT TAGGAATAAC AGCGTCATTG GCGAATGCGT TGTTGTACTA TAGTGCAGGG ACGAAGAAGT ATGGGGTATT TGATGAGGGA GCGAAGAATT TAGCGAAGGA ATTGCTGGAC AGGATGTGGA AGTTGTACAG GGATGAGAAG GGATTGTCAG CGCCAGAGAA GAGAGCGGAC TACAAGAGGT TCTTTGAGCA AGAGGTATAT ATACCGGCAG GATGGATAGG GAAGATGCCG AATGGAGATG TAATAAAGAG TGGAGTTAAG TTTATAGACA TAAGGAGCAA GTATAAACAA GATCCTGATT GGCCGAAGTT AGAGGCGGCA TACAAGTCAG GGCAGGCACC TGAGTTCAGA TATCACAGGT TCTGGGCACA GTGCGACATA GCAATAGCTA ATGCAACATA TGAAATACTG TTTGGCAATC AATAA
|
Protein sequence | MRRGIALFIA IIFLCTVMYS LTFPVKAESI VTTQKYIWKN VKIEGGGGFI TGIVFNPKEK NLVYVRTDIG GAYRSTDGGN TWTQLMNWVS FDEWNLLGVE SIATDPVDPN RLYIAAGTYT NSWTNMNGVL LRSKDKGNTF ERTPLPFKLG GNMPGRNMGE RLAIDPNHNN ILYLGTREGN GLWKSIDYGV TWKKVENFPN PGTYVEDPNC PHDYLNHITG VVWVVFDPTS GKPGEGSKVI YVGVADKNTS IYYTKDGGQT WEALPGQPTG LLPQHAKLSS DGMLYITYSN TQGPYNGDYG EVWRYNTKTG EWKNISPMPA KDTYFGYGGL AVDAQNPKVV MVAALSSWWP DTYIWRSTDG GETWKCIWEW AGYPNRTLHY TMDISAAPWL NFGITNPIPP ETSPKLGWMV AALEIDPFNS DRMLYGTGAT LYGCDDLTKW DKGEKITIKV KGIGIEETSV AALVSPPIGP HLFSAIYDIA GFRHDDLEKA PNWTYTQPNM GSTTDIDYAE LNPNFMVRVG NVDKTWNPNT NRIGFSYDGG KSWFQGNQEP QGMTEGGTVA AAADASVVVW APKGAPVSYS TDNGNTWYQC ANVPSGAIVY SDRVNPNKFY AFKDGKFYIS SDKGKTFVQS PASGLPTSGN FKAVPGREGD IWLVGNNGMW HSVDGGYTFT KIGNVEEAAS IGFGKPAEGK TYPAIYTYAK INGIRGIFRS DDAGASWVRI NDDNNQFGCA NADITGDQRV YGRVFVATNG LGIKWGEIAD SATIPTSTPA PTVTATLTPT PTATPTPTPT PTPTPTSTAT PTPTPTPTPT STPTATPTAT PTPTPTPSST PVAGGQIKVL YANKETNSTT NTIRPWLKVV NTGSSSIDLS RVTIRYWYTV DGDKAQSAIS DWAQIGASNV TFKFVKLSSS VSGADYYLEI GFKSGAGQLQ AGKDTGEIQI RFNKDDWSNY NQGNDWSWMQ SMTSYGENVK VTAYIDGVLV WGQEPSGATP TPTATPAPTV TPTPTPAPTP TPTPTPTATP TATPTPTPTP TPTPTATPTV TATPTPTPSS TPVAGGQIKV LYANKETNST TNTIRPWLKV VNTGSSSIDL SRVTIRYWYT VDGDKAQSAI SDWAQIGASN VTFKFVKLSS SVSGADYYLE IGFKSGAGQL QAGKDTGEIQ IRFNKDDWSN YNQGNDWSWM QSMTSYGENV KVTAYIDGVL VWGQEPSGAT PTPTATPAPT VTPTPTPAPT PTPTPTPTAT PTATPTPTPT PTPTPTATPT VTATPTPTPS STPSVLGEYG QRFMWLWNKI HDPANGYFNQ DGIPYHSVET LICEAPDYGH LTTSEAFSYY VWLEAVYGKL TGDWSKFKTA WDTLEKYMIP SAEDQPMRSY DPNKPATYAG EWETPDKYPS PLEFNVPVGK DPLHNELVST YGSTLMYGMH WLMDVDNWYG YGKRGDGVSR ASFINTFQRG PEESVWETVP HPSWEEFKWG GPNGFLDLFI KDQNYSKQWR YTDAPDADAR AIQATYWAKV WAKEQGKFNE ISSYVAKAAK MGDYLRYAMF DKYFKPLGCQ DKNAAGGTGY DSAHYLLSWY YAWGGALDGA WSWKIGSSHV HFGYQNPMAA WALANDSDMK PKSPNGASDW AKSLKRQIEF YRWLQSAEGA IAGGATNSWN GRYEKYPAGT ATFYGMAYEP NPVYHDPGSN TWFGFQAWSM QRVAEYYYVT GDKDAGALLE KWVSWVKSVV KLNSDGTFAI PSTLDWSGQP DTWNGAYTGN SNLHVKVVDY GTDLGITASL ANALLYYSAG TKKYGVFDEG AKNLAKELLD RMWKLYRDEK GLSAPEKRAD YKRFFEQEVY IPAGWIGKMP NGDVIKSGVK FIDIRSKYKQ DPDWPKLEAA YKSGQAPEFR YHRFWAQCDI AIANATYEIL FGNQ
|
| |