Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0597 |
Symbol | |
ID | 7406938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 673118 |
End bp | 674620 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714980 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002572496 |
Protein GI | 222528614 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATCTCA AAAAGTTTTT CGTAGTCATG CTTGTTGTGA CTTTTGTACT CACAAGTGTA ATTGGTGTAG TGACGGGATT TGGTGCATCT TCTTCAAAAC TTCCTTATGT TAAGCTTACA TGGTATGTCA TTGGAACACC TCAAAAAGAC TGGGATTTAA TCAATCAAAA AGTAAATGAG TACATCAAAC CAAAGCTTAA TGCTGAAATC AAAATGACAA TGTTTGACTG GGGCGAATAT AATGATAAGC TCCAGACAAA GATTGCAGCA AGTGAGCCAT TTGATATCTG TTTTACAGCA ATCTGGACAA ACAACTACAG AACTAACGTT GCAAAAGGTG CATTTTTGCC GCTCAATAAG CCCGGAAACG ACCTTCTTTC TAAGTATGCA CCAAAGACAA AGAAGCTTCT TGGCGATGAT TTCATAAAAG GTGCATCCAT TAACGGAATT CTGTATGCAA TTCCAGCAAA CAAGGAGAAG GCTCATAACT GGGGATTCAT TGTTAGAATG GACTTGGTAA AGAAGTATAA ATTAGAAGAC ATGTTTAAAA AGGTTAAGAA ATTAGAAGAT TTAGAGCCAT ATCTTAAGGT AATCAAACAA AAAGAGCCAG GTGTATATCC ACTTGGAGCA TATGCTGGTG AGTCGCCAAG ATTCCTTTTA GACTGGGACA AGGTTGTAGA CGATGATGTT CCTGTATCAC TTTATCCGAA TAATAAGAGC ACAAAGATTG TTAATGAACT TGAACAGCCA AATACAAAAG CTCTCTTTAA GACAGTAAGA AAATATTACT TGGCAGGTTA TATCAGAAAA GATGCAGCAA GTGTTACAGA CTGGATGTCT GATTTAAAAG CTGGTAAAGT GTTTGTAATG CCCCAGTCGC TCAAGCCAGG AAAAGATGCT GAGATGTCTA TTTCAACAGG TTATGAATGG AAACAGATAG ATATAACACC ACCTGTTATG TCAACAAGAG AATGTATAGG TTCTATGCAG GCAATCAACG CAAAGTCAAA GAATCCAGAA AGAGCTTTAA TGTTCTTAGA GCTTTTCAAC ACAGACAAGT ATCTTAACAA CCTTGTAAAC TTTGGTATTG AAGGTCAGCA CTATGTATTT AAAGATAAAG CAAGAGGAAT CATAGCTCCA GGACCAAAGG CAAAAGACTA TAGCCCAGGT CTTGGCTGGA TGTTTGGAAA TCAATTTATA AACTATATTT ATGAAAATGA AGATCCTAAC AAATGGAAAA ACTTTGAAGA GTATAACAAG AAGGCACTGC CTCTTCTTAG CCTTGGATTC AACTTTGATG ACTCAAAAGT AAAAACACAG GTTGCAGCAT GCAAGAGCGT ATGGAAGCAG TATATTCCAA TGCTTGAGAC AGGGAGTGTA GACCCTGATA AATACATTCC ACAGGCAATT GACAAGTTCA AGAAAGCAGG TGTTGACATT ATTATAAAAG AGGCACAGAA GCAGTATGAT GAATTTCTGA AGAAGACAGG AAGAAAGAAA TAA
|
Protein sequence | MNLKKFFVVM LVVTFVLTSV IGVVTGFGAS SSKLPYVKLT WYVIGTPQKD WDLINQKVNE YIKPKLNAEI KMTMFDWGEY NDKLQTKIAA SEPFDICFTA IWTNNYRTNV AKGAFLPLNK PGNDLLSKYA PKTKKLLGDD FIKGASINGI LYAIPANKEK AHNWGFIVRM DLVKKYKLED MFKKVKKLED LEPYLKVIKQ KEPGVYPLGA YAGESPRFLL DWDKVVDDDV PVSLYPNNKS TKIVNELEQP NTKALFKTVR KYYLAGYIRK DAASVTDWMS DLKAGKVFVM PQSLKPGKDA EMSISTGYEW KQIDITPPVM STRECIGSMQ AINAKSKNPE RALMFLELFN TDKYLNNLVN FGIEGQHYVF KDKARGIIAP GPKAKDYSPG LGWMFGNQFI NYIYENEDPN KWKNFEEYNK KALPLLSLGF NFDDSKVKTQ VAACKSVWKQ YIPMLETGSV DPDKYIPQAI DKFKKAGVDI IIKEAQKQYD EFLKKTGRKK
|
| |