Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1865 |
Symbol | |
ID | 7408978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1959955 |
End bp | 1964064 |
Gene Length | 4110 bp |
Protein Length | 1369 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643716237 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_002573726 |
Protein GI | 222529844 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.56874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAAC TAAAAAGAGC AATAAAAATG ATTACATTTT GTGTTGCTAT GGTATTTCTA TTGCAGGTTT TCTTTCTATT TTCAGGATAT AATAACAGTG AAGTAAAAGC AGCAACAACC TTTAACTATG GTGAAGCTCT TCAAAAAGCG ATCATGTTTT ATGAATTTCA GATGTCAGGT AAACTACCAT CATGGATCCG TAACAACTGG CGCGGGGATT CTGGTCTAAA TGATGGCAAA GATGTAGGTT TAGATCTTAC TGGTGGCTGG CATGATGCGG GCGACCATGT AAAGTTTAAT CTACCAATGT CATACAGTGC ATCAATGCTT TCGTGGGCAG TTTATGAGTA CAAAGCAGCA TTTGAGAAAA GTGGTCAGCT TGAACATATA CTTAACCAGA TTGAATGGGT AAACGACTAC TTTGTAAAAT GCCATCCATC AAAGTATGTA TACTACTATC AAGTTGGTGA CCCAATTGAA GATCATAACT TCTGGGGTCC AGCAGAAGTT ATGCAAATGA AACGACCAGC ATACAAGTGT GACTTAAATA ATCCAGCAAG TTCGGTTGTT GCAGAAACAG CAGCATCCTT AGCTGCAGCT TCAATCGTCA TACGTGAAAG AAATAGTCAA AAGGCAGACA CATATTTGCA GCATGCGATG GTACTCTTTG ATTTTGCCGA TAGAACTCGT AGTGATGCAG GGTATACCGC AGCAACAGGC TTTTACACAT CAGGTGGTTT TATTGATGAT CTTGGTTGGG CAGCAGTGTG GTTATATCTT GCGACAAATG ACAAATCATA TTTAGATAAA GCTGAGGCAC TTATGGCAGA ATATGCCGGT GGCACAAATA CATGGACACA GTGCTGGGAC GATGTAAGAT ACGGAGCAAT ATTGCTTTTA GCAAAAATTA CTAATAAAGA CATATATAAA GGTGCTGTTG AAAGAAATCT TGATCATTGG ACATATAACA TAACCTATAC ACCTAAAGGT CTTGCATGGA TAACAGGGTG GGGCTCACTT AGGTATGCCA CAACTGCAGC TTTCTTAGCG TTTGTTTATG CAGATTGGTC AGGATGTCCA GAAAATAAGC GAACAGCTTA TCTAAAATTT GGTGAGAGTC AGATTAACTA TGCATTAGGT TCAACAGGAA GAAGCTTTTT GGTAGGATTT GGGCAAAATT ATCCACAACA TCCACATCAC AGAAATGCAC ACAGTTCATG GGCGAACAGT ATGCGAATAC CTGAATATCA TCGACACATA CTTTATGGTG CATTAGTAGG CGGACCAGGC TCTGATGATA GTTACAATGA TGATATTACT GACTATGTTC AAAACGAGGT GGCTTGTGAC TACAATGCTG GTATTGTAGG TGCTCTGGCA AAAATGTACC TTATGTATGG AGGAGACCCA ATACCTAATT TCAAAGCTAT CGAAAAGCCA ACTAATGATG AAATTTTTGT TGAATCCAAG TTTGGTAATT CACAGGGTAC AAACTATACC GAAATAATTT CATACATTTA TAACAGAACG GGATGGCCGC CTCGAGTCAC AGATAATCTA AACTTTAAGT ATTTTATTGA CCTAAGTGAG TTAATCAAGG CTGGGTATGG TCCTGATGTT GTTAAAGTAG AGACATATTA TTCAGAAGGT GGAAAAATAT CTGGACCATA CGTATGGAAT GCATCAAAGA ACCTTTACTA TATATTAGTT GATTTTACAG GAACAAAAAT ATATCCAGGT GGGGAAGTAG AACACAAAAA ACAAGCTCAA TTTAAGATAT CTGTGCCACA AGGTGTTCCA TGGGATCCAA CTAATGACCC ATCTTATGCA GGATTAACAA AAGAACTTAG TAAAAATAAG TTCATAGCAG CTTATGAAGG TAACGTGCTG GTATGGGGAC AAGAACCAGA GGGTTCGTCA AGTTCAACCC CAACCCCAAC ACCAACACCA ACACCAACAC TGACTCCAAC ACCGACATCA ACTGCTACAC CAACACCGAC ACCTACACCA ACACCAACGT CAACACCAAC TGCTACACCA ACAGCAACGC CAACACCAAC ACCGACGCCG AGCAGCACAC CTGTAGCAGG CGGGCAGATA AAGGTATTGT ATGCTAACAA GGAGACAAAT AGCACAACAA ACACGATAAG GCCATGGTTG AAGGTAGTGA ACACTGGAAG CAGCAGCATA GATTTAAGCA GGGTAACGAT AAGGTACTGG TACACGGTAG ATGGGGACAA GGCACAGAGT GCGATATCAG ACTGGGCACA GATAGGAGCA AGCAATGTGA CATTCAAGTT TGTGAAGCTG AGCAGTAGCG TAAGTGGAGC GGACTATTAT TTAGAGATAG GATTTAAGAG TGGAGCTGGG CAGTTGCAGG CTGGTAAAGA CACAGGGGAG ATACAGATAA GGTTTAACAA GAGTGACTGG AGCAATTACA ATCAGGGGAA TGACTGGTCA TGGATGCAGA GCATGACGAG TTATGGAGAG AATGTGAAGG TAACAGCGTA TATAGATGGT GTATTGGTAT GGGGACAGGA GCCGAGTGGA GCGACACCAA CACCGACAGC AACACCAGCA CCGACAGTGA CACCGACACC AACACCAGCA CCAACACCAA CCCCGACTCC AACACCAACT GCTACACCAA CGCCAACACC GACTCCAACA CCAACACCAA CTGCTACCCC AACACCGACG CCGAGCAGCA CACCTGTAGC AGGTGGACAG ATAAAGGTAT TGTATGCTAA CAAGGAGACA AATAGCACAA CAAACACGAT AAGGCCATGG TTGAAGGTAG TGAACACTGG AAGCAGCAGC ATAGATTTAA GCAGGGTAAC GATAAGGTAC TGGTACACGG TAGATGGGGA CAAGGCACAG AGTGCGATAT CAGACTGGGC ACAGATAGGA GCAAGCAATG TGACATTCAA GTTTGTGAAG CTGAGCAGTA GCGTAAGTGG AGCGGACTAT TATTTAGAGA TAGGATTTAA GAGTGGAGCT GGGCAGTTGC AGGCTGGTAA AGACACAGGG GAGATACAGA TAAGGTTTAA CAAGAGTGAC TGGAGCAATT ACAATCAGGG GAATGACTGG TCATGGATGC AGAGCATGAC GAGTTATGGA GAGAATGTGA AGGTAACAGC GTATATAGAT GGTGTATTGG TATGGGGACA GGAGCCGAGT GGAGCGACAC CAACACCGAC AGCAACACCA GCACCGACAG TGACACCTAC ACCTACACCA ACTCCAACTC CAACGCCGAG CAGTGGAATA GTGAAGATAG ATACTAGCAC ATTAATAGGA ACAAATCACG CACATTGCTG GTACAGAGAT AAACTTGAGA CGGCATTGCG AGGAATAAGG TCATGGGGTA TGAACTCTGT GAGGGTAGTG TTGAGTAATG GCTATCGATG GACGAAGATA CCAGCAAGTG AAGTAGCAAA TATTATATCA TTGTCAAGAA GTCTTGGATT CAGAGCCATT GTATTAGAAG TTCACGACAC GACAGGATAT GGTGAGGACG GTGCAGCATG TTCATTGGCG CAAGCAGTAG AATATTGGAA AGAGATAAAG AGTGTGTTAG AAGGCAATGA GGATTTTGTT ATAATAAACA TTGGTAATGA GCCGTATGGG AACAATAACT ATCAAAACTG GATTAATGAC ACGAAGAATG CTATAAAAGC GCTAAGGGAT GCAGGGTTCA AGCACACGAT AATGGTTGAT GCACCGAACT GGGGGCAGGA TTGGTCTAAT ACTATGAGAG ACAATGCCCA GAGCATAATG GAAGCAGATC CGCTGCGCAA TTTGGTATTT TCGATTCATA TGTACGGTGT ATACAATACA GCGAGCAAGG TAGAAGAATA TATCAAGTCA TTTGTGGAGA AAGGGCTGCC ATTAGTTATT GGGGAGTTTG GGCATCAGCA TACAGATGGT GACCCTGACG AGGAAGCTAT TGTCAGGTAT GCAAAACAAT ACAAGATAGG ACTTTTTAGC TGGTCTTGGT GTGGCAATTC GAGCTATGTA GGGTACTTGG ACATGGTAAA CAATTGGGAC CCCAATAATC CAACTCCATG GGGGCAATGG TATAAAACTA ATGCGATTGG TGCTGAATAA
|
Protein sequence | MLKLKRAIKM ITFCVAMVFL LQVFFLFSGY NNSEVKAATT FNYGEALQKA IMFYEFQMSG KLPSWIRNNW RGDSGLNDGK DVGLDLTGGW HDAGDHVKFN LPMSYSASML SWAVYEYKAA FEKSGQLEHI LNQIEWVNDY FVKCHPSKYV YYYQVGDPIE DHNFWGPAEV MQMKRPAYKC DLNNPASSVV AETAASLAAA SIVIRERNSQ KADTYLQHAM VLFDFADRTR SDAGYTAATG FYTSGGFIDD LGWAAVWLYL ATNDKSYLDK AEALMAEYAG GTNTWTQCWD DVRYGAILLL AKITNKDIYK GAVERNLDHW TYNITYTPKG LAWITGWGSL RYATTAAFLA FVYADWSGCP ENKRTAYLKF GESQINYALG STGRSFLVGF GQNYPQHPHH RNAHSSWANS MRIPEYHRHI LYGALVGGPG SDDSYNDDIT DYVQNEVACD YNAGIVGALA KMYLMYGGDP IPNFKAIEKP TNDEIFVESK FGNSQGTNYT EIISYIYNRT GWPPRVTDNL NFKYFIDLSE LIKAGYGPDV VKVETYYSEG GKISGPYVWN ASKNLYYILV DFTGTKIYPG GEVEHKKQAQ FKISVPQGVP WDPTNDPSYA GLTKELSKNK FIAAYEGNVL VWGQEPEGSS SSTPTPTPTP TPTLTPTPTS TATPTPTPTP TPTSTPTATP TATPTPTPTP SSTPVAGGQI KVLYANKETN STTNTIRPWL KVVNTGSSSI DLSRVTIRYW YTVDGDKAQS AISDWAQIGA SNVTFKFVKL SSSVSGADYY LEIGFKSGAG QLQAGKDTGE IQIRFNKSDW SNYNQGNDWS WMQSMTSYGE NVKVTAYIDG VLVWGQEPSG ATPTPTATPA PTVTPTPTPA PTPTPTPTPT ATPTPTPTPT PTPTATPTPT PSSTPVAGGQ IKVLYANKET NSTTNTIRPW LKVVNTGSSS IDLSRVTIRY WYTVDGDKAQ SAISDWAQIG ASNVTFKFVK LSSSVSGADY YLEIGFKSGA GQLQAGKDTG EIQIRFNKSD WSNYNQGNDW SWMQSMTSYG ENVKVTAYID GVLVWGQEPS GATPTPTATP APTVTPTPTP TPTPTPSSGI VKIDTSTLIG TNHAHCWYRD KLETALRGIR SWGMNSVRVV LSNGYRWTKI PASEVANIIS LSRSLGFRAI VLEVHDTTGY GEDGAACSLA QAVEYWKEIK SVLEGNEDFV IINIGNEPYG NNNYQNWIND TKNAIKALRD AGFKHTIMVD APNWGQDWSN TMRDNAQSIM EADPLRNLVF SIHMYGVYNT ASKVEEYIKS FVEKGLPLVI GEFGHQHTDG DPDEEAIVRY AKQYKIGLFS WSWCGNSSYV GYLDMVNNWD PNNPTPWGQW YKTNAIGAE
|
| |