Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1859 |
Symbol | |
ID | 7408972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1943641 |
End bp | 1947525 |
Gene Length | 3885 bp |
Protein Length | 1294 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643716231 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_002573720 |
Protein GI | 222529838 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTAA AAACAAAAAT GGGGAAGAAA TGGTTGAGTA TACTATGTAC AGTTGTTTTT TTATTGAACA TTTTGTTTAT AGCAAATGTA ACGAATTTAC CCAAAGTTGG TGCGGCTACA TCTAATGATG GAGTAGTGAA GATAGATACT AGCACATTAA TAGGAACAAA TCACGCACAT TGCTGGTACA GAGATAAACT TGAGACGGCA TTGCGAGGAA TAAGGTCATG GGGTATGAAC TCTGTGAGGG TAGTGTTGAG TAATGGCTAT CGATGGACGA AGATACCAGC AAGTGAAGTA GCAAATATTA TATCATTGTC AAGAAGTCTT GGATTCAGAG CCATTGTATT AGAAGTTCAC GACACGACAG GATATGGTGA GGACGGTGCA GCATGTTCAT TGGCGCAAGC AGTAGAATAT TGGAAAGAGA TAAAGAGTGT GTTAGAAGGC AATGAGGATT TTGTTATAAT AAACATTGGT AATGAGCCGT ATGGGAACAA TAACTATCAA AACTGGATTA ATGACACGAA GAATGCTATA AAAGCGCTAA GGGATGCAGG GTTCAAGCAC ACGATAATGG TTGATGCACC GAACTGGGGG CAGGATTGGT CTAATACTAT GAGAGACAAT GCCCAGAGCA TAATGGAAGC AGATCCGCTG CGCAATTTGG TATTTTCGAT TCATATGTAC GGTGTATACA ATACAGCGAG CAAGGTAGAA GAATATATCA AGTCATTTGT GGAGAAAGGG CTGCCATTAG TTATTGGGGA GTTTGGGCAT CAGCATACAG ATGGTGACCC TGACGAGGAA GCTATTGTCA GGTATGCAAA ACAATACAAG ATAGGACTTT TTAGCTGGTC TTGGTGTGGC AATTCGAGCT ATGTAGGGTA CTTGGACATG GTAAACAATT GGGACCCCAA TAATCCAACT CCATGGGGGC AATGGTATAA AACTAATGCG ATTGGTGCCT CTTCAGTACC TACTTCAACA CCAACACCGA CACCAACTGC TACACCAACA GCAACGCCAA CACCAACACC GACGCCGAGC AGCACACCTG TAGCAGGTGG ACAGATAAAG GTATTGTATG CTAACAAGGA GACAAATAGC ACAACAAATA CGATAAGGCC ATGGTTGAAG GTAGTGAACA CTGGAAGCAG CAGCATAGAT TTGAGCAGGG TAACGATAAG GTACTGGTAC ACGGTAGATG GGGACAAGGC ACAGAGTGCG ATATCAGACT GGGCACAGAT AGGAGCAAGC AATGTGACAT TCAAGTTTGT GAAGCTGAGC AGTAGCGTAA GTGGAGCGGA CTATTATTTA GAGATAGGAT TTAAGAGTGG AGCTGGGCAG TTGCAGGCTG GTAAAGACAC AGGGGAGATA CAGATAAGGT TTAACAAGAG TGACTGGAGC AATTACAATC AGGGGAATGA CTGGTCATGG ATGCAGAGCA TGACGAGTTA TGGAGAGAAT GTGAAGGTAA CAGCGTATAT AGATGGTGTA TTGGTATGGG GACAGGAGCC GAGTGGAGCG ACACCAACAC CGACAGCAAC ACCAGCACCG ACAGTGACAC CGACAGCAAC ACCAGCACCA ACACCAACCC CGACCCCAAC ACCAACTGCT ACACCAACGC CAACACCGAC TCCAACACCA ACACCAACTG CTACCCCAAC ACCGACGCCG AGCAGTACAC CTGTAGCAGG TGGACAGATA AAGGTACTGT ATGCTAACAA GGAGACAAAT AGCACAACAA ACACGATAAG GCCATGGTTG AAGGTAGTGA ACACTGGAAG CAGCAGCATA GATTTGAGCA GGGTAACGAT AAGGTACTGG TACACGGTAG ATGGGGACAA GGCACAGAGT GCGATATCAG ACTGGGCACA GATAGGAGCA AGCAATGTGA CATTCAAGTT TGTGAAGCTG AGCAGTAGCG TAAGTGGAGC GGACTATTAT TTAGAGATAG GATTTAAGAG TGGAGCTGGG CAGTTGCAGG CTGGTAAAGA CACAGGGGAG ATACAGATAA GGTTTAACAA GAGTGACTGG AGCAATTACA ATCAGGGGAA TGACTGGTCA TGGATGCAGA GCATGACGAG TTATGGAGAG AATGTGAAGG TAACAGCGTA TATAGATGGT GTATTGGTAT GGGGACAGGA GCCGAGTGGA GCGACACCAA CACCGACAGC AACACCAGCA CCAACATCGA CATCGACGCC AACACCGACA GTAACACCAA CCCCGACCCC AACACCAACT GCTACACCAA CACCCACGGC AACGTCAATT CCATTACCAA CAGTATCACC ATCGTCGGCT GTTATTGAAA TAGCAATAAA TACAAATAAA GATAGGTCAC CAATTAGCCC GTACATTTAT GGTGCAAACC AGGATATTGG AGGTGTAGTT CATCCTGCAA GAAGGTTAGG TGGAAACAGA CTAACAGGAT ACAATTGGGA AAACAACTTT TCAAATGCGG GGAACGATTG GTATCATTCA AGTGACGATT ATTTGTGCTG GAGCATGGGA ATTTCTGGTG AAGATGCGAA GGTTCCAGCA GCAGTGGTAT CTAAATTTCA TGAGTATTCC CTTAAAAATA ATGCTTATTC TGCTATAACT TTGCAAATGG CAGGATATGT GTCAAAAGAT AATTATGGTA CTGTTAGTGA AAATGAAACA GCTCCATCTA ACAGGTGGGC AGAGGTAAAA TTTAAGAAGG ATGCTCCTTT ATCTTTGAAT CCAGACTTGA ATGATAACTT TGTTTATATG GATGAATTCA TAAATTATTT GATAAACAAA TACGGAATGG CTTCTTCACC TACCGGGATA AAAGGGTATA TACTTGATAA TGAGCCTGAT TTGTGGGTCT CAACACATCC CCGTATACAT CCTAATAAGG TCACATGCAA AGAGTTGATT GATAAATCTG TTGAACTGGC AAAAGTTATA AAAACCCTTG ATCCATCAGC TGAAGTTTTT GGATATGCAT CATATGGGTT TATGGGTTAT TATAGTCTCC AAGATGCGCC TGATTGGAAC CAAGTTAAAG GAGATCATAG ATGGTTTATA AGCTGGTATC TGGAACAGAT GAAAAAAGCA TCAGACAGTT ATGGAAAAAG ATTATTAGAT GTGCTTGATT TACACTGGTA TCCAGAAGCA CGAGGTGGAA ATATTCGCGT GTGCTTTGAT GGCGAAAATG ACACATCAAA AGAAGTTGCT ATAGCTAGGA TGCAAGCTCC AAGAACACTA TGGGACCCGA CCTACAAAAC ATCAGTGAAA GGGCAAATTA CAGCTGGTGA GAACAGCTGG ATAAACCAGT GGTTTTCAGA TTATTTGCCT ATAATTCCAA ACATAAAAGC GGACATAGAG AAATATTATC CTGGTACAAA ACTTGCTATT AGCGAATTCG ATTATGGCGG TCGAAATCAT ATTTCAGGGG GAATTGCTTT AGCTGATGTG CTCGGTATAT TTGGTAAATA TGGAGTGTAC TTTGCAGCAA GATGGGGCGA TTCTGGTAGT TATGCAGCAG CTGCATATAA CATTTATCTT AATTATGATG GAAAAGGCTC AAAATATGGC AATACAAATG TAGGTGCTAA TACAAATGAT GTTGAAAATA TGCCAGTTTA TGCTTCAATA AATGGACAGG ATGATTCTGA ACTTCATATT ATACTAATAA ACAGAAACTA TGACAGAAAA TTGCCTGCGA AGATCAGCAT TACAAGTTCA AAAAACTATA CAAAAGCAGA AATTTATGGT TTTGATAGCA ATAGTCCTAC TGTTAGAAAA ATGGGAAGTG TGGATAATAT CGAAAACAAT GTTTTAACTC TTGAGGTACC TAATTTAACA GTTTTCCATA TCGTTTTATA TTCAACCTCA GTACAAACTA AATAA
|
Protein sequence | MRVKTKMGKK WLSILCTVVF LLNILFIANV TNLPKVGAAT SNDGVVKIDT STLIGTNHAH CWYRDKLETA LRGIRSWGMN SVRVVLSNGY RWTKIPASEV ANIISLSRSL GFRAIVLEVH DTTGYGEDGA ACSLAQAVEY WKEIKSVLEG NEDFVIINIG NEPYGNNNYQ NWINDTKNAI KALRDAGFKH TIMVDAPNWG QDWSNTMRDN AQSIMEADPL RNLVFSIHMY GVYNTASKVE EYIKSFVEKG LPLVIGEFGH QHTDGDPDEE AIVRYAKQYK IGLFSWSWCG NSSYVGYLDM VNNWDPNNPT PWGQWYKTNA IGASSVPTST PTPTPTATPT ATPTPTPTPS STPVAGGQIK VLYANKETNS TTNTIRPWLK VVNTGSSSID LSRVTIRYWY TVDGDKAQSA ISDWAQIGAS NVTFKFVKLS SSVSGADYYL EIGFKSGAGQ LQAGKDTGEI QIRFNKSDWS NYNQGNDWSW MQSMTSYGEN VKVTAYIDGV LVWGQEPSGA TPTPTATPAP TVTPTATPAP TPTPTPTPTA TPTPTPTPTP TPTATPTPTP SSTPVAGGQI KVLYANKETN STTNTIRPWL KVVNTGSSSI DLSRVTIRYW YTVDGDKAQS AISDWAQIGA SNVTFKFVKL SSSVSGADYY LEIGFKSGAG QLQAGKDTGE IQIRFNKSDW SNYNQGNDWS WMQSMTSYGE NVKVTAYIDG VLVWGQEPSG ATPTPTATPA PTSTSTPTPT VTPTPTPTPT ATPTPTATSI PLPTVSPSSA VIEIAINTNK DRSPISPYIY GANQDIGGVV HPARRLGGNR LTGYNWENNF SNAGNDWYHS SDDYLCWSMG ISGEDAKVPA AVVSKFHEYS LKNNAYSAIT LQMAGYVSKD NYGTVSENET APSNRWAEVK FKKDAPLSLN PDLNDNFVYM DEFINYLINK YGMASSPTGI KGYILDNEPD LWVSTHPRIH PNKVTCKELI DKSVELAKVI KTLDPSAEVF GYASYGFMGY YSLQDAPDWN QVKGDHRWFI SWYLEQMKKA SDSYGKRLLD VLDLHWYPEA RGGNIRVCFD GENDTSKEVA IARMQAPRTL WDPTYKTSVK GQITAGENSW INQWFSDYLP IIPNIKADIE KYYPGTKLAI SEFDYGGRNH ISGGIALADV LGIFGKYGVY FAARWGDSGS YAAAAYNIYL NYDGKGSKYG NTNVGANTND VENMPVYASI NGQDDSELHI ILINRNYDRK LPAKISITSS KNYTKAEIYG FDSNSPTVRK MGSVDNIENN VLTLEVPNLT VFHIVLYSTS VQTK
|
| |