Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0174 |
Symbol | |
ID | 7407165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 212384 |
End bp | 215239 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643714576 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002572099 |
Protein GI | 222528217 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.01491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCTGGGT TTTTAACAAT TCTGAAAAGG ACAAAATTTG TCATTGTATA TTTACTAGTC ATATTACTCA TATTGCTCTG TAAAATCTCA GCATTTGCTT CAAGTGAAGT ACCTACCCTT CAAGATTACC TAAAAAAAGT GGGAAATGTT GAAAGACCAA AGAAAGAAAT TATAATTGAG GCAATTAATT ACACAAGTTC AAGTAATGCA AGTGTCAGAA AAATATCACA ATTCTATGGG CAAAAAGATG TGCTTTTATG GGAAAAAGAA GGTGGATGGT TAGAGTGGAG TGTAAATATT CCGGAAGATG GATTTTATAA TATGGCTCTT CTCTATTATC CTTTGCCAGG TAAAGGGCTG GGGATAGAAT TTAGCGTATT TATTGATGGA AAAATTCCTT ATAAAGAAGC CCAAAAAGTT ACATTTCCAA GAGTATGGAA GGATACAACA GGAATTAGGA AAGATAAAAA AGGCAATGAT TTGAGACCAA AATGTGATGA ACACCAGCAA TGGCAAAAAA TTGATTTTAT AGATACTGAG GGTTTCTATA ACAAAGCTTT ACCGTTCTAT TTTACAAAAG GCGAACATAA AATTAGACTT ATAAGTATTA GAGAGCCTAT CGCATTGAAG CAGTTGATTA TATATAACAG TGAGGAATTA CCAACTTATG AGGAGTACAT ATCAAAAAGT CGTGAAAAAA ATTCAAAAAA TGTGTTTATA AAAATTCAAG GTGAGAATAC ATATCTAAAA TCTGATCCAA TACTATATCC TACTTATGAT AGAACCGACC CCGCAACAGA GCCTTATCAT GTTTCTAAAA TAAGACTTAA CACAATAGGT CAGTGGAATT GGCGTTATCC GGGGATGTGG ATAAGCTGGA TTTTTGAAGT TCCGCAGGAT GGATATTATA AAATTGCAAT AAAAGCAAGA CAGAATTTTG TCCGAGGGTT ATCTGTTCAC AGAAAGTTAT ATATTGACGG AAAAATTCCT TTCAAAGAAG CTGAGGATAT TGAATTTCCA TATAGTATTA GCTGGTATAT GAAAACAATA GGCAAGAAAA ATCAACCATA TCTAATATAT TTGAAAAAAG GTGTTCATGA ATTAAGGTTG GAATCAACCC TGGGTGCTTT TTCAGAGATT TTGAGTAGAG TTGAAAGTAC CACAATAGAT TTAAACAATT TGTATAGAAA AATAATAATG ATTACAGGCA CATCACCTGA CTTGTATCGT GACTATTTCC TTGAAGAGCA AATACCAGAG CTTGTCAGTA CATTAAAAAG ATTAAGCAAT GAGCTGGAAG AAGAAGCTGC TATGTTTGAA AAACTTGCAG GGCAAAAAGG CGGAGAAGCT GAGTTTTTAA GAAGAGTTGC TCTTCAACTC AGGAGTATGG CTGAAGATAC TGACACAATA CCTGGAAGAT TAACAAGTTT TAGAGACAAT TTGAGTGGAT TATCTTCGTG GCTTGCATAT AGAAGAGATC AGCCATTAGA GATAGATTAT ATTTTGATTA CATCTCCTGA GGAAAAGCTG CCCTCGCCGA CAGCTTCTAT TGGTAATAAG ATTGTTAATT CAATAAAAGC ATTTTTGTAT TCCTTTGTTG AAGATTACAA TAACGTAGGT GAAGTGTATC AGGGGCAAAA GGTTATTAAA GTTTGGGTTG GCGGTGGTCG CGATCAGGCG CAGATTATAA GAGATCTAAT CAATGATTCA TTTACACCAC AGACAGGAAT TAAAGTAAAT GTTAGCCTGG TTCAAGCAGG ATTAATTGAA GCAATACTTG CAGGAAAAGG TCCAGATATA GTTTTAACAG TTTCAAGGGC ACAACCAGTT AATTTAGCCG CACGTGGTGC ACTTGTTGAT TTGAGCAAAT TTAAAGATTT TAATGAAGTT AAAAAAAGGT TTGCTAAAAC TGCTTTAGTT CCATATACGT ATAATGGCGG TGTATATGGG CTTCCAGTTA CTCAGGATTT TTATATGATG TTTTACAGAA AAGATATCTT AAAAGAGCTA AATATTGAAT TGCCACGAAC ATGGGATGAT ATGTACAAAG TCATAGCAAA GCTTCAGAGA TATAATCTTC AGGTTGGTCT TCCATATCAA AGGATTGACG CTCTTGAAGC AATTGATGCG GGGCTTGGTG CAAGAAATCT CTTTCCTACA TTATTGCTTC AGTTTGGTGG AAGCTTTTAT GACAAAACAA AAACACGAAC ACTATTGGAT AGACCAGAAG CTGTAGCTGC ATTTAAGACT TGGACAGATT TTTACACAAA GTACAATCTT CCTTTGATAT ATGACTTTTA CAACAGATTC AGAACAGGTG AGATGCCACT TGGAATAGCA CCATATACCA CGTATAACCT GTTATCGACA GCTGCACCTG AAATTCGAAA TGAATGGGGA ATGGCACCAA TACCTGGGGT AAAAAAGCCA AACGGTGAAA TAGACCGTTC TACAGGTGGG TCAGGTACAG CATGTATAAT ATTAAAGAAA AGCAGAAATA AAGAAGCATG CTGGGAGTTT TTGAAATGGT GGACATCTGA TGAAATTCAA ACACAGTTTG GGAAAGAGCT TGAGATGCTG ATGGGTACTG CTGCAAGATA CAATACAGCA AATTTGAGAG CTTTTCAAAG ACTTCCATGG AACAAAGAGG AGATAGAAAA TTTAGAGACA CAGTGGAAAT ATGTAAAAGA AATAGAGGAA GTTCCAGGAA GTTATTACAT TACAAGAAGT ATAGACAGTG CATTTTCAGC TGTTGTTTAT CAGGGGATAA ATCCAAGAGA AAGTATGTGG AAATATACAA AAGAAATCAA CGATGAGCTT GAAAGAAAGA GGATAGAGCT TAGTTTGAAT AAATAA
|
Protein sequence | MSGFLTILKR TKFVIVYLLV ILLILLCKIS AFASSEVPTL QDYLKKVGNV ERPKKEIIIE AINYTSSSNA SVRKISQFYG QKDVLLWEKE GGWLEWSVNI PEDGFYNMAL LYYPLPGKGL GIEFSVFIDG KIPYKEAQKV TFPRVWKDTT GIRKDKKGND LRPKCDEHQQ WQKIDFIDTE GFYNKALPFY FTKGEHKIRL ISIREPIALK QLIIYNSEEL PTYEEYISKS REKNSKNVFI KIQGENTYLK SDPILYPTYD RTDPATEPYH VSKIRLNTIG QWNWRYPGMW ISWIFEVPQD GYYKIAIKAR QNFVRGLSVH RKLYIDGKIP FKEAEDIEFP YSISWYMKTI GKKNQPYLIY LKKGVHELRL ESTLGAFSEI LSRVESTTID LNNLYRKIIM ITGTSPDLYR DYFLEEQIPE LVSTLKRLSN ELEEEAAMFE KLAGQKGGEA EFLRRVALQL RSMAEDTDTI PGRLTSFRDN LSGLSSWLAY RRDQPLEIDY ILITSPEEKL PSPTASIGNK IVNSIKAFLY SFVEDYNNVG EVYQGQKVIK VWVGGGRDQA QIIRDLINDS FTPQTGIKVN VSLVQAGLIE AILAGKGPDI VLTVSRAQPV NLAARGALVD LSKFKDFNEV KKRFAKTALV PYTYNGGVYG LPVTQDFYMM FYRKDILKEL NIELPRTWDD MYKVIAKLQR YNLQVGLPYQ RIDALEAIDA GLGARNLFPT LLLQFGGSFY DKTKTRTLLD RPEAVAAFKT WTDFYTKYNL PLIYDFYNRF RTGEMPLGIA PYTTYNLLST AAPEIRNEWG MAPIPGVKKP NGEIDRSTGG SGTACIILKK SRNKEACWEF LKWWTSDEIQ TQFGKELEML MGTAARYNTA NLRAFQRLPW NKEEIENLET QWKYVKEIEE VPGSYYITRS IDSAFSAVVY QGINPRESMW KYTKEINDEL ERKRIELSLN K
|
| |