Gene Athe_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1865 
Symbol 
ID7408978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1959955 
End bp1964064 
Gene Length4110 bp 
Protein Length1369 aa 
Translation table11 
GC content43% 
IMG OID643716237 
Productglycoside hydrolase family 9 
Protein accessionYP_002573726 
Protein GI222529844 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.56874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAAC TAAAAAGAGC AATAAAAATG ATTACATTTT GTGTTGCTAT GGTATTTCTA 
TTGCAGGTTT TCTTTCTATT TTCAGGATAT AATAACAGTG AAGTAAAAGC AGCAACAACC
TTTAACTATG GTGAAGCTCT TCAAAAAGCG ATCATGTTTT ATGAATTTCA GATGTCAGGT
AAACTACCAT CATGGATCCG TAACAACTGG CGCGGGGATT CTGGTCTAAA TGATGGCAAA
GATGTAGGTT TAGATCTTAC TGGTGGCTGG CATGATGCGG GCGACCATGT AAAGTTTAAT
CTACCAATGT CATACAGTGC ATCAATGCTT TCGTGGGCAG TTTATGAGTA CAAAGCAGCA
TTTGAGAAAA GTGGTCAGCT TGAACATATA CTTAACCAGA TTGAATGGGT AAACGACTAC
TTTGTAAAAT GCCATCCATC AAAGTATGTA TACTACTATC AAGTTGGTGA CCCAATTGAA
GATCATAACT TCTGGGGTCC AGCAGAAGTT ATGCAAATGA AACGACCAGC ATACAAGTGT
GACTTAAATA ATCCAGCAAG TTCGGTTGTT GCAGAAACAG CAGCATCCTT AGCTGCAGCT
TCAATCGTCA TACGTGAAAG AAATAGTCAA AAGGCAGACA CATATTTGCA GCATGCGATG
GTACTCTTTG ATTTTGCCGA TAGAACTCGT AGTGATGCAG GGTATACCGC AGCAACAGGC
TTTTACACAT CAGGTGGTTT TATTGATGAT CTTGGTTGGG CAGCAGTGTG GTTATATCTT
GCGACAAATG ACAAATCATA TTTAGATAAA GCTGAGGCAC TTATGGCAGA ATATGCCGGT
GGCACAAATA CATGGACACA GTGCTGGGAC GATGTAAGAT ACGGAGCAAT ATTGCTTTTA
GCAAAAATTA CTAATAAAGA CATATATAAA GGTGCTGTTG AAAGAAATCT TGATCATTGG
ACATATAACA TAACCTATAC ACCTAAAGGT CTTGCATGGA TAACAGGGTG GGGCTCACTT
AGGTATGCCA CAACTGCAGC TTTCTTAGCG TTTGTTTATG CAGATTGGTC AGGATGTCCA
GAAAATAAGC GAACAGCTTA TCTAAAATTT GGTGAGAGTC AGATTAACTA TGCATTAGGT
TCAACAGGAA GAAGCTTTTT GGTAGGATTT GGGCAAAATT ATCCACAACA TCCACATCAC
AGAAATGCAC ACAGTTCATG GGCGAACAGT ATGCGAATAC CTGAATATCA TCGACACATA
CTTTATGGTG CATTAGTAGG CGGACCAGGC TCTGATGATA GTTACAATGA TGATATTACT
GACTATGTTC AAAACGAGGT GGCTTGTGAC TACAATGCTG GTATTGTAGG TGCTCTGGCA
AAAATGTACC TTATGTATGG AGGAGACCCA ATACCTAATT TCAAAGCTAT CGAAAAGCCA
ACTAATGATG AAATTTTTGT TGAATCCAAG TTTGGTAATT CACAGGGTAC AAACTATACC
GAAATAATTT CATACATTTA TAACAGAACG GGATGGCCGC CTCGAGTCAC AGATAATCTA
AACTTTAAGT ATTTTATTGA CCTAAGTGAG TTAATCAAGG CTGGGTATGG TCCTGATGTT
GTTAAAGTAG AGACATATTA TTCAGAAGGT GGAAAAATAT CTGGACCATA CGTATGGAAT
GCATCAAAGA ACCTTTACTA TATATTAGTT GATTTTACAG GAACAAAAAT ATATCCAGGT
GGGGAAGTAG AACACAAAAA ACAAGCTCAA TTTAAGATAT CTGTGCCACA AGGTGTTCCA
TGGGATCCAA CTAATGACCC ATCTTATGCA GGATTAACAA AAGAACTTAG TAAAAATAAG
TTCATAGCAG CTTATGAAGG TAACGTGCTG GTATGGGGAC AAGAACCAGA GGGTTCGTCA
AGTTCAACCC CAACCCCAAC ACCAACACCA ACACCAACAC TGACTCCAAC ACCGACATCA
ACTGCTACAC CAACACCGAC ACCTACACCA ACACCAACGT CAACACCAAC TGCTACACCA
ACAGCAACGC CAACACCAAC ACCGACGCCG AGCAGCACAC CTGTAGCAGG CGGGCAGATA
AAGGTATTGT ATGCTAACAA GGAGACAAAT AGCACAACAA ACACGATAAG GCCATGGTTG
AAGGTAGTGA ACACTGGAAG CAGCAGCATA GATTTAAGCA GGGTAACGAT AAGGTACTGG
TACACGGTAG ATGGGGACAA GGCACAGAGT GCGATATCAG ACTGGGCACA GATAGGAGCA
AGCAATGTGA CATTCAAGTT TGTGAAGCTG AGCAGTAGCG TAAGTGGAGC GGACTATTAT
TTAGAGATAG GATTTAAGAG TGGAGCTGGG CAGTTGCAGG CTGGTAAAGA CACAGGGGAG
ATACAGATAA GGTTTAACAA GAGTGACTGG AGCAATTACA ATCAGGGGAA TGACTGGTCA
TGGATGCAGA GCATGACGAG TTATGGAGAG AATGTGAAGG TAACAGCGTA TATAGATGGT
GTATTGGTAT GGGGACAGGA GCCGAGTGGA GCGACACCAA CACCGACAGC AACACCAGCA
CCGACAGTGA CACCGACACC AACACCAGCA CCAACACCAA CCCCGACTCC AACACCAACT
GCTACACCAA CGCCAACACC GACTCCAACA CCAACACCAA CTGCTACCCC AACACCGACG
CCGAGCAGCA CACCTGTAGC AGGTGGACAG ATAAAGGTAT TGTATGCTAA CAAGGAGACA
AATAGCACAA CAAACACGAT AAGGCCATGG TTGAAGGTAG TGAACACTGG AAGCAGCAGC
ATAGATTTAA GCAGGGTAAC GATAAGGTAC TGGTACACGG TAGATGGGGA CAAGGCACAG
AGTGCGATAT CAGACTGGGC ACAGATAGGA GCAAGCAATG TGACATTCAA GTTTGTGAAG
CTGAGCAGTA GCGTAAGTGG AGCGGACTAT TATTTAGAGA TAGGATTTAA GAGTGGAGCT
GGGCAGTTGC AGGCTGGTAA AGACACAGGG GAGATACAGA TAAGGTTTAA CAAGAGTGAC
TGGAGCAATT ACAATCAGGG GAATGACTGG TCATGGATGC AGAGCATGAC GAGTTATGGA
GAGAATGTGA AGGTAACAGC GTATATAGAT GGTGTATTGG TATGGGGACA GGAGCCGAGT
GGAGCGACAC CAACACCGAC AGCAACACCA GCACCGACAG TGACACCTAC ACCTACACCA
ACTCCAACTC CAACGCCGAG CAGTGGAATA GTGAAGATAG ATACTAGCAC ATTAATAGGA
ACAAATCACG CACATTGCTG GTACAGAGAT AAACTTGAGA CGGCATTGCG AGGAATAAGG
TCATGGGGTA TGAACTCTGT GAGGGTAGTG TTGAGTAATG GCTATCGATG GACGAAGATA
CCAGCAAGTG AAGTAGCAAA TATTATATCA TTGTCAAGAA GTCTTGGATT CAGAGCCATT
GTATTAGAAG TTCACGACAC GACAGGATAT GGTGAGGACG GTGCAGCATG TTCATTGGCG
CAAGCAGTAG AATATTGGAA AGAGATAAAG AGTGTGTTAG AAGGCAATGA GGATTTTGTT
ATAATAAACA TTGGTAATGA GCCGTATGGG AACAATAACT ATCAAAACTG GATTAATGAC
ACGAAGAATG CTATAAAAGC GCTAAGGGAT GCAGGGTTCA AGCACACGAT AATGGTTGAT
GCACCGAACT GGGGGCAGGA TTGGTCTAAT ACTATGAGAG ACAATGCCCA GAGCATAATG
GAAGCAGATC CGCTGCGCAA TTTGGTATTT TCGATTCATA TGTACGGTGT ATACAATACA
GCGAGCAAGG TAGAAGAATA TATCAAGTCA TTTGTGGAGA AAGGGCTGCC ATTAGTTATT
GGGGAGTTTG GGCATCAGCA TACAGATGGT GACCCTGACG AGGAAGCTAT TGTCAGGTAT
GCAAAACAAT ACAAGATAGG ACTTTTTAGC TGGTCTTGGT GTGGCAATTC GAGCTATGTA
GGGTACTTGG ACATGGTAAA CAATTGGGAC CCCAATAATC CAACTCCATG GGGGCAATGG
TATAAAACTA ATGCGATTGG TGCTGAATAA
 
Protein sequence
MLKLKRAIKM ITFCVAMVFL LQVFFLFSGY NNSEVKAATT FNYGEALQKA IMFYEFQMSG 
KLPSWIRNNW RGDSGLNDGK DVGLDLTGGW HDAGDHVKFN LPMSYSASML SWAVYEYKAA
FEKSGQLEHI LNQIEWVNDY FVKCHPSKYV YYYQVGDPIE DHNFWGPAEV MQMKRPAYKC
DLNNPASSVV AETAASLAAA SIVIRERNSQ KADTYLQHAM VLFDFADRTR SDAGYTAATG
FYTSGGFIDD LGWAAVWLYL ATNDKSYLDK AEALMAEYAG GTNTWTQCWD DVRYGAILLL
AKITNKDIYK GAVERNLDHW TYNITYTPKG LAWITGWGSL RYATTAAFLA FVYADWSGCP
ENKRTAYLKF GESQINYALG STGRSFLVGF GQNYPQHPHH RNAHSSWANS MRIPEYHRHI
LYGALVGGPG SDDSYNDDIT DYVQNEVACD YNAGIVGALA KMYLMYGGDP IPNFKAIEKP
TNDEIFVESK FGNSQGTNYT EIISYIYNRT GWPPRVTDNL NFKYFIDLSE LIKAGYGPDV
VKVETYYSEG GKISGPYVWN ASKNLYYILV DFTGTKIYPG GEVEHKKQAQ FKISVPQGVP
WDPTNDPSYA GLTKELSKNK FIAAYEGNVL VWGQEPEGSS SSTPTPTPTP TPTLTPTPTS
TATPTPTPTP TPTSTPTATP TATPTPTPTP SSTPVAGGQI KVLYANKETN STTNTIRPWL
KVVNTGSSSI DLSRVTIRYW YTVDGDKAQS AISDWAQIGA SNVTFKFVKL SSSVSGADYY
LEIGFKSGAG QLQAGKDTGE IQIRFNKSDW SNYNQGNDWS WMQSMTSYGE NVKVTAYIDG
VLVWGQEPSG ATPTPTATPA PTVTPTPTPA PTPTPTPTPT ATPTPTPTPT PTPTATPTPT
PSSTPVAGGQ IKVLYANKET NSTTNTIRPW LKVVNTGSSS IDLSRVTIRY WYTVDGDKAQ
SAISDWAQIG ASNVTFKFVK LSSSVSGADY YLEIGFKSGA GQLQAGKDTG EIQIRFNKSD
WSNYNQGNDW SWMQSMTSYG ENVKVTAYID GVLVWGQEPS GATPTPTATP APTVTPTPTP
TPTPTPSSGI VKIDTSTLIG TNHAHCWYRD KLETALRGIR SWGMNSVRVV LSNGYRWTKI
PASEVANIIS LSRSLGFRAI VLEVHDTTGY GEDGAACSLA QAVEYWKEIK SVLEGNEDFV
IINIGNEPYG NNNYQNWIND TKNAIKALRD AGFKHTIMVD APNWGQDWSN TMRDNAQSIM
EADPLRNLVF SIHMYGVYNT ASKVEEYIKS FVEKGLPLVI GEFGHQHTDG DPDEEAIVRY
AKQYKIGLFS WSWCGNSSYV GYLDMVNNWD PNNPTPWGQW YKTNAIGAE