Gene Athe_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1859 
Symbol 
ID7408972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1943641 
End bp1947525 
Gene Length3885 bp 
Protein Length1294 aa 
Translation table11 
GC content42% 
IMG OID643716231 
Productglycoside hydrolase family 5 
Protein accessionYP_002573720 
Protein GI222529838 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTAA AAACAAAAAT GGGGAAGAAA TGGTTGAGTA TACTATGTAC AGTTGTTTTT 
TTATTGAACA TTTTGTTTAT AGCAAATGTA ACGAATTTAC CCAAAGTTGG TGCGGCTACA
TCTAATGATG GAGTAGTGAA GATAGATACT AGCACATTAA TAGGAACAAA TCACGCACAT
TGCTGGTACA GAGATAAACT TGAGACGGCA TTGCGAGGAA TAAGGTCATG GGGTATGAAC
TCTGTGAGGG TAGTGTTGAG TAATGGCTAT CGATGGACGA AGATACCAGC AAGTGAAGTA
GCAAATATTA TATCATTGTC AAGAAGTCTT GGATTCAGAG CCATTGTATT AGAAGTTCAC
GACACGACAG GATATGGTGA GGACGGTGCA GCATGTTCAT TGGCGCAAGC AGTAGAATAT
TGGAAAGAGA TAAAGAGTGT GTTAGAAGGC AATGAGGATT TTGTTATAAT AAACATTGGT
AATGAGCCGT ATGGGAACAA TAACTATCAA AACTGGATTA ATGACACGAA GAATGCTATA
AAAGCGCTAA GGGATGCAGG GTTCAAGCAC ACGATAATGG TTGATGCACC GAACTGGGGG
CAGGATTGGT CTAATACTAT GAGAGACAAT GCCCAGAGCA TAATGGAAGC AGATCCGCTG
CGCAATTTGG TATTTTCGAT TCATATGTAC GGTGTATACA ATACAGCGAG CAAGGTAGAA
GAATATATCA AGTCATTTGT GGAGAAAGGG CTGCCATTAG TTATTGGGGA GTTTGGGCAT
CAGCATACAG ATGGTGACCC TGACGAGGAA GCTATTGTCA GGTATGCAAA ACAATACAAG
ATAGGACTTT TTAGCTGGTC TTGGTGTGGC AATTCGAGCT ATGTAGGGTA CTTGGACATG
GTAAACAATT GGGACCCCAA TAATCCAACT CCATGGGGGC AATGGTATAA AACTAATGCG
ATTGGTGCCT CTTCAGTACC TACTTCAACA CCAACACCGA CACCAACTGC TACACCAACA
GCAACGCCAA CACCAACACC GACGCCGAGC AGCACACCTG TAGCAGGTGG ACAGATAAAG
GTATTGTATG CTAACAAGGA GACAAATAGC ACAACAAATA CGATAAGGCC ATGGTTGAAG
GTAGTGAACA CTGGAAGCAG CAGCATAGAT TTGAGCAGGG TAACGATAAG GTACTGGTAC
ACGGTAGATG GGGACAAGGC ACAGAGTGCG ATATCAGACT GGGCACAGAT AGGAGCAAGC
AATGTGACAT TCAAGTTTGT GAAGCTGAGC AGTAGCGTAA GTGGAGCGGA CTATTATTTA
GAGATAGGAT TTAAGAGTGG AGCTGGGCAG TTGCAGGCTG GTAAAGACAC AGGGGAGATA
CAGATAAGGT TTAACAAGAG TGACTGGAGC AATTACAATC AGGGGAATGA CTGGTCATGG
ATGCAGAGCA TGACGAGTTA TGGAGAGAAT GTGAAGGTAA CAGCGTATAT AGATGGTGTA
TTGGTATGGG GACAGGAGCC GAGTGGAGCG ACACCAACAC CGACAGCAAC ACCAGCACCG
ACAGTGACAC CGACAGCAAC ACCAGCACCA ACACCAACCC CGACCCCAAC ACCAACTGCT
ACACCAACGC CAACACCGAC TCCAACACCA ACACCAACTG CTACCCCAAC ACCGACGCCG
AGCAGTACAC CTGTAGCAGG TGGACAGATA AAGGTACTGT ATGCTAACAA GGAGACAAAT
AGCACAACAA ACACGATAAG GCCATGGTTG AAGGTAGTGA ACACTGGAAG CAGCAGCATA
GATTTGAGCA GGGTAACGAT AAGGTACTGG TACACGGTAG ATGGGGACAA GGCACAGAGT
GCGATATCAG ACTGGGCACA GATAGGAGCA AGCAATGTGA CATTCAAGTT TGTGAAGCTG
AGCAGTAGCG TAAGTGGAGC GGACTATTAT TTAGAGATAG GATTTAAGAG TGGAGCTGGG
CAGTTGCAGG CTGGTAAAGA CACAGGGGAG ATACAGATAA GGTTTAACAA GAGTGACTGG
AGCAATTACA ATCAGGGGAA TGACTGGTCA TGGATGCAGA GCATGACGAG TTATGGAGAG
AATGTGAAGG TAACAGCGTA TATAGATGGT GTATTGGTAT GGGGACAGGA GCCGAGTGGA
GCGACACCAA CACCGACAGC AACACCAGCA CCAACATCGA CATCGACGCC AACACCGACA
GTAACACCAA CCCCGACCCC AACACCAACT GCTACACCAA CACCCACGGC AACGTCAATT
CCATTACCAA CAGTATCACC ATCGTCGGCT GTTATTGAAA TAGCAATAAA TACAAATAAA
GATAGGTCAC CAATTAGCCC GTACATTTAT GGTGCAAACC AGGATATTGG AGGTGTAGTT
CATCCTGCAA GAAGGTTAGG TGGAAACAGA CTAACAGGAT ACAATTGGGA AAACAACTTT
TCAAATGCGG GGAACGATTG GTATCATTCA AGTGACGATT ATTTGTGCTG GAGCATGGGA
ATTTCTGGTG AAGATGCGAA GGTTCCAGCA GCAGTGGTAT CTAAATTTCA TGAGTATTCC
CTTAAAAATA ATGCTTATTC TGCTATAACT TTGCAAATGG CAGGATATGT GTCAAAAGAT
AATTATGGTA CTGTTAGTGA AAATGAAACA GCTCCATCTA ACAGGTGGGC AGAGGTAAAA
TTTAAGAAGG ATGCTCCTTT ATCTTTGAAT CCAGACTTGA ATGATAACTT TGTTTATATG
GATGAATTCA TAAATTATTT GATAAACAAA TACGGAATGG CTTCTTCACC TACCGGGATA
AAAGGGTATA TACTTGATAA TGAGCCTGAT TTGTGGGTCT CAACACATCC CCGTATACAT
CCTAATAAGG TCACATGCAA AGAGTTGATT GATAAATCTG TTGAACTGGC AAAAGTTATA
AAAACCCTTG ATCCATCAGC TGAAGTTTTT GGATATGCAT CATATGGGTT TATGGGTTAT
TATAGTCTCC AAGATGCGCC TGATTGGAAC CAAGTTAAAG GAGATCATAG ATGGTTTATA
AGCTGGTATC TGGAACAGAT GAAAAAAGCA TCAGACAGTT ATGGAAAAAG ATTATTAGAT
GTGCTTGATT TACACTGGTA TCCAGAAGCA CGAGGTGGAA ATATTCGCGT GTGCTTTGAT
GGCGAAAATG ACACATCAAA AGAAGTTGCT ATAGCTAGGA TGCAAGCTCC AAGAACACTA
TGGGACCCGA CCTACAAAAC ATCAGTGAAA GGGCAAATTA CAGCTGGTGA GAACAGCTGG
ATAAACCAGT GGTTTTCAGA TTATTTGCCT ATAATTCCAA ACATAAAAGC GGACATAGAG
AAATATTATC CTGGTACAAA ACTTGCTATT AGCGAATTCG ATTATGGCGG TCGAAATCAT
ATTTCAGGGG GAATTGCTTT AGCTGATGTG CTCGGTATAT TTGGTAAATA TGGAGTGTAC
TTTGCAGCAA GATGGGGCGA TTCTGGTAGT TATGCAGCAG CTGCATATAA CATTTATCTT
AATTATGATG GAAAAGGCTC AAAATATGGC AATACAAATG TAGGTGCTAA TACAAATGAT
GTTGAAAATA TGCCAGTTTA TGCTTCAATA AATGGACAGG ATGATTCTGA ACTTCATATT
ATACTAATAA ACAGAAACTA TGACAGAAAA TTGCCTGCGA AGATCAGCAT TACAAGTTCA
AAAAACTATA CAAAAGCAGA AATTTATGGT TTTGATAGCA ATAGTCCTAC TGTTAGAAAA
ATGGGAAGTG TGGATAATAT CGAAAACAAT GTTTTAACTC TTGAGGTACC TAATTTAACA
GTTTTCCATA TCGTTTTATA TTCAACCTCA GTACAAACTA AATAA
 
Protein sequence
MRVKTKMGKK WLSILCTVVF LLNILFIANV TNLPKVGAAT SNDGVVKIDT STLIGTNHAH 
CWYRDKLETA LRGIRSWGMN SVRVVLSNGY RWTKIPASEV ANIISLSRSL GFRAIVLEVH
DTTGYGEDGA ACSLAQAVEY WKEIKSVLEG NEDFVIINIG NEPYGNNNYQ NWINDTKNAI
KALRDAGFKH TIMVDAPNWG QDWSNTMRDN AQSIMEADPL RNLVFSIHMY GVYNTASKVE
EYIKSFVEKG LPLVIGEFGH QHTDGDPDEE AIVRYAKQYK IGLFSWSWCG NSSYVGYLDM
VNNWDPNNPT PWGQWYKTNA IGASSVPTST PTPTPTATPT ATPTPTPTPS STPVAGGQIK
VLYANKETNS TTNTIRPWLK VVNTGSSSID LSRVTIRYWY TVDGDKAQSA ISDWAQIGAS
NVTFKFVKLS SSVSGADYYL EIGFKSGAGQ LQAGKDTGEI QIRFNKSDWS NYNQGNDWSW
MQSMTSYGEN VKVTAYIDGV LVWGQEPSGA TPTPTATPAP TVTPTATPAP TPTPTPTPTA
TPTPTPTPTP TPTATPTPTP SSTPVAGGQI KVLYANKETN STTNTIRPWL KVVNTGSSSI
DLSRVTIRYW YTVDGDKAQS AISDWAQIGA SNVTFKFVKL SSSVSGADYY LEIGFKSGAG
QLQAGKDTGE IQIRFNKSDW SNYNQGNDWS WMQSMTSYGE NVKVTAYIDG VLVWGQEPSG
ATPTPTATPA PTSTSTPTPT VTPTPTPTPT ATPTPTATSI PLPTVSPSSA VIEIAINTNK
DRSPISPYIY GANQDIGGVV HPARRLGGNR LTGYNWENNF SNAGNDWYHS SDDYLCWSMG
ISGEDAKVPA AVVSKFHEYS LKNNAYSAIT LQMAGYVSKD NYGTVSENET APSNRWAEVK
FKKDAPLSLN PDLNDNFVYM DEFINYLINK YGMASSPTGI KGYILDNEPD LWVSTHPRIH
PNKVTCKELI DKSVELAKVI KTLDPSAEVF GYASYGFMGY YSLQDAPDWN QVKGDHRWFI
SWYLEQMKKA SDSYGKRLLD VLDLHWYPEA RGGNIRVCFD GENDTSKEVA IARMQAPRTL
WDPTYKTSVK GQITAGENSW INQWFSDYLP IIPNIKADIE KYYPGTKLAI SEFDYGGRNH
ISGGIALADV LGIFGKYGVY FAARWGDSGS YAAAAYNIYL NYDGKGSKYG NTNVGANTND
VENMPVYASI NGQDDSELHI ILINRNYDRK LPAKISITSS KNYTKAEIYG FDSNSPTVRK
MGSVDNIENN VLTLEVPNLT VFHIVLYSTS VQTK