Gene Athe_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0174 
Symbol 
ID7407165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp212384 
End bp215239 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content35% 
IMG OID643714576 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002572099 
Protein GI222528217 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTGGGT TTTTAACAAT TCTGAAAAGG ACAAAATTTG TCATTGTATA TTTACTAGTC 
ATATTACTCA TATTGCTCTG TAAAATCTCA GCATTTGCTT CAAGTGAAGT ACCTACCCTT
CAAGATTACC TAAAAAAAGT GGGAAATGTT GAAAGACCAA AGAAAGAAAT TATAATTGAG
GCAATTAATT ACACAAGTTC AAGTAATGCA AGTGTCAGAA AAATATCACA ATTCTATGGG
CAAAAAGATG TGCTTTTATG GGAAAAAGAA GGTGGATGGT TAGAGTGGAG TGTAAATATT
CCGGAAGATG GATTTTATAA TATGGCTCTT CTCTATTATC CTTTGCCAGG TAAAGGGCTG
GGGATAGAAT TTAGCGTATT TATTGATGGA AAAATTCCTT ATAAAGAAGC CCAAAAAGTT
ACATTTCCAA GAGTATGGAA GGATACAACA GGAATTAGGA AAGATAAAAA AGGCAATGAT
TTGAGACCAA AATGTGATGA ACACCAGCAA TGGCAAAAAA TTGATTTTAT AGATACTGAG
GGTTTCTATA ACAAAGCTTT ACCGTTCTAT TTTACAAAAG GCGAACATAA AATTAGACTT
ATAAGTATTA GAGAGCCTAT CGCATTGAAG CAGTTGATTA TATATAACAG TGAGGAATTA
CCAACTTATG AGGAGTACAT ATCAAAAAGT CGTGAAAAAA ATTCAAAAAA TGTGTTTATA
AAAATTCAAG GTGAGAATAC ATATCTAAAA TCTGATCCAA TACTATATCC TACTTATGAT
AGAACCGACC CCGCAACAGA GCCTTATCAT GTTTCTAAAA TAAGACTTAA CACAATAGGT
CAGTGGAATT GGCGTTATCC GGGGATGTGG ATAAGCTGGA TTTTTGAAGT TCCGCAGGAT
GGATATTATA AAATTGCAAT AAAAGCAAGA CAGAATTTTG TCCGAGGGTT ATCTGTTCAC
AGAAAGTTAT ATATTGACGG AAAAATTCCT TTCAAAGAAG CTGAGGATAT TGAATTTCCA
TATAGTATTA GCTGGTATAT GAAAACAATA GGCAAGAAAA ATCAACCATA TCTAATATAT
TTGAAAAAAG GTGTTCATGA ATTAAGGTTG GAATCAACCC TGGGTGCTTT TTCAGAGATT
TTGAGTAGAG TTGAAAGTAC CACAATAGAT TTAAACAATT TGTATAGAAA AATAATAATG
ATTACAGGCA CATCACCTGA CTTGTATCGT GACTATTTCC TTGAAGAGCA AATACCAGAG
CTTGTCAGTA CATTAAAAAG ATTAAGCAAT GAGCTGGAAG AAGAAGCTGC TATGTTTGAA
AAACTTGCAG GGCAAAAAGG CGGAGAAGCT GAGTTTTTAA GAAGAGTTGC TCTTCAACTC
AGGAGTATGG CTGAAGATAC TGACACAATA CCTGGAAGAT TAACAAGTTT TAGAGACAAT
TTGAGTGGAT TATCTTCGTG GCTTGCATAT AGAAGAGATC AGCCATTAGA GATAGATTAT
ATTTTGATTA CATCTCCTGA GGAAAAGCTG CCCTCGCCGA CAGCTTCTAT TGGTAATAAG
ATTGTTAATT CAATAAAAGC ATTTTTGTAT TCCTTTGTTG AAGATTACAA TAACGTAGGT
GAAGTGTATC AGGGGCAAAA GGTTATTAAA GTTTGGGTTG GCGGTGGTCG CGATCAGGCG
CAGATTATAA GAGATCTAAT CAATGATTCA TTTACACCAC AGACAGGAAT TAAAGTAAAT
GTTAGCCTGG TTCAAGCAGG ATTAATTGAA GCAATACTTG CAGGAAAAGG TCCAGATATA
GTTTTAACAG TTTCAAGGGC ACAACCAGTT AATTTAGCCG CACGTGGTGC ACTTGTTGAT
TTGAGCAAAT TTAAAGATTT TAATGAAGTT AAAAAAAGGT TTGCTAAAAC TGCTTTAGTT
CCATATACGT ATAATGGCGG TGTATATGGG CTTCCAGTTA CTCAGGATTT TTATATGATG
TTTTACAGAA AAGATATCTT AAAAGAGCTA AATATTGAAT TGCCACGAAC ATGGGATGAT
ATGTACAAAG TCATAGCAAA GCTTCAGAGA TATAATCTTC AGGTTGGTCT TCCATATCAA
AGGATTGACG CTCTTGAAGC AATTGATGCG GGGCTTGGTG CAAGAAATCT CTTTCCTACA
TTATTGCTTC AGTTTGGTGG AAGCTTTTAT GACAAAACAA AAACACGAAC ACTATTGGAT
AGACCAGAAG CTGTAGCTGC ATTTAAGACT TGGACAGATT TTTACACAAA GTACAATCTT
CCTTTGATAT ATGACTTTTA CAACAGATTC AGAACAGGTG AGATGCCACT TGGAATAGCA
CCATATACCA CGTATAACCT GTTATCGACA GCTGCACCTG AAATTCGAAA TGAATGGGGA
ATGGCACCAA TACCTGGGGT AAAAAAGCCA AACGGTGAAA TAGACCGTTC TACAGGTGGG
TCAGGTACAG CATGTATAAT ATTAAAGAAA AGCAGAAATA AAGAAGCATG CTGGGAGTTT
TTGAAATGGT GGACATCTGA TGAAATTCAA ACACAGTTTG GGAAAGAGCT TGAGATGCTG
ATGGGTACTG CTGCAAGATA CAATACAGCA AATTTGAGAG CTTTTCAAAG ACTTCCATGG
AACAAAGAGG AGATAGAAAA TTTAGAGACA CAGTGGAAAT ATGTAAAAGA AATAGAGGAA
GTTCCAGGAA GTTATTACAT TACAAGAAGT ATAGACAGTG CATTTTCAGC TGTTGTTTAT
CAGGGGATAA ATCCAAGAGA AAGTATGTGG AAATATACAA AAGAAATCAA CGATGAGCTT
GAAAGAAAGA GGATAGAGCT TAGTTTGAAT AAATAA
 
Protein sequence
MSGFLTILKR TKFVIVYLLV ILLILLCKIS AFASSEVPTL QDYLKKVGNV ERPKKEIIIE 
AINYTSSSNA SVRKISQFYG QKDVLLWEKE GGWLEWSVNI PEDGFYNMAL LYYPLPGKGL
GIEFSVFIDG KIPYKEAQKV TFPRVWKDTT GIRKDKKGND LRPKCDEHQQ WQKIDFIDTE
GFYNKALPFY FTKGEHKIRL ISIREPIALK QLIIYNSEEL PTYEEYISKS REKNSKNVFI
KIQGENTYLK SDPILYPTYD RTDPATEPYH VSKIRLNTIG QWNWRYPGMW ISWIFEVPQD
GYYKIAIKAR QNFVRGLSVH RKLYIDGKIP FKEAEDIEFP YSISWYMKTI GKKNQPYLIY
LKKGVHELRL ESTLGAFSEI LSRVESTTID LNNLYRKIIM ITGTSPDLYR DYFLEEQIPE
LVSTLKRLSN ELEEEAAMFE KLAGQKGGEA EFLRRVALQL RSMAEDTDTI PGRLTSFRDN
LSGLSSWLAY RRDQPLEIDY ILITSPEEKL PSPTASIGNK IVNSIKAFLY SFVEDYNNVG
EVYQGQKVIK VWVGGGRDQA QIIRDLINDS FTPQTGIKVN VSLVQAGLIE AILAGKGPDI
VLTVSRAQPV NLAARGALVD LSKFKDFNEV KKRFAKTALV PYTYNGGVYG LPVTQDFYMM
FYRKDILKEL NIELPRTWDD MYKVIAKLQR YNLQVGLPYQ RIDALEAIDA GLGARNLFPT
LLLQFGGSFY DKTKTRTLLD RPEAVAAFKT WTDFYTKYNL PLIYDFYNRF RTGEMPLGIA
PYTTYNLLST AAPEIRNEWG MAPIPGVKKP NGEIDRSTGG SGTACIILKK SRNKEACWEF
LKWWTSDEIQ TQFGKELEML MGTAARYNTA NLRAFQRLPW NKEEIENLET QWKYVKEIEE
VPGSYYITRS IDSAFSAVVY QGINPRESMW KYTKEINDEL ERKRIELSLN K