Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0614 |
Symbol | |
ID | 7406955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 697213 |
End bp | 698520 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714995 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002572511 |
Protein GI | 222528629 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAGGC TTGGAAAAAT TGTAGCGTTG GTAACTTTGG TTGTTTTTAT GTTGAGCTTA GTTTTAGTTT TTTCACCATC TGAAAAGGCT TTCAGCTCAG CTGGTGGAAC AAAGAAGATT GTATTTTGGC ACATCACAAC TGATGCTGTC GGAAAGCAAA TGATTCAAAA CCAAGTAAAC AGATTTTTGA AAGCGCATCC AGACTTTAAG GTTGAAGTAG TTCCGCTACA GAATGATTCA TTCAAGACAA AATTGAAAAT TGCTATGGGT TCTAACCAGG CACCAGATGT ATTTGTTACA TGGGGTGGCG GTCCTTTCTT TGAATATGTA AGGGCTGGTA AGGTAAAAGA TATTACAGCT TATATGAACG AGAAAAATTA TAAGAGCAGA TTTGTTGATG CAGCATTTGG GCCAATTACT TATAACAATA AGATATGGGC AGTTCCAGTT GAAGGCGCTG GTGTTGCGCT TGTATGGTAC AATAAGGAAA TATTCAAAAA ATACAATTTG AAAGTACCAA CGACTTATGA TGAGTTTCTA AATATTGTAA AGGTATTAAA ATCAAAAGGA ATAACTCCAA TAACTCTTGC TAATAAAACA AAATGGCCTG GTTGTTTCTG GTACTGGAAT TTGGTAAATA AATTAGGTGG TGCTTCAGCG TTTGAAAAAG CTGCTGAGAG AACAGGTGGT AGCTTTGCAG ATGAACCGTT TGTGAAAGCA GGTCAAATGC TTCAGGATTT GGTCAAAATG GGGGCATTCC CAAAAGGATT TAATGGACTT GACTGGGATA CGGGACAATC AAGAATGCTT TTATACTCTG GTAAAGCTGC AATGGAATTA ATGGGAACTT GGCAGATACC TATAGTAAAG GCTGAAAAGA AAGAATTTTA TGAAAAGAAT CTTGATTTCT TCCCATTCCC ATCGATTAAG GGTGGGAAAG GAGATCCAAC AAGTTTAGTT GGGATGGCGT CTTCAAACTA CTATGCTGTG ACAACTACAT GTAAGTATCC AAAAGAAGCA TTCAATATGA TTCAATACTT GATTGATGAT CAGGCTGTAA AAGAGAGAAT TAACTATGGA CAAATTCCTC CTGTAAAAGG ATTAAAGTTT TCTGATCCAA TGCTTCAAAA GGTTTACAAC ATAGTTGCAA AAGCAAAGAG TATTCAGCTT TGGTGGGATC AATACCTTCC ACCACAACTT GCAGAGACTC ATAAGGATAT TGTGCAATCT CTCTTTGGCT TGACAATTAC ACCAAAAGCT GCTGCAGAAA AAATGGAGAA AGCTGCTAAG GAATATTTCA AGAAGTAA
|
Protein sequence | MKRLGKIVAL VTLVVFMLSL VLVFSPSEKA FSSAGGTKKI VFWHITTDAV GKQMIQNQVN RFLKAHPDFK VEVVPLQNDS FKTKLKIAMG SNQAPDVFVT WGGGPFFEYV RAGKVKDITA YMNEKNYKSR FVDAAFGPIT YNNKIWAVPV EGAGVALVWY NKEIFKKYNL KVPTTYDEFL NIVKVLKSKG ITPITLANKT KWPGCFWYWN LVNKLGGASA FEKAAERTGG SFADEPFVKA GQMLQDLVKM GAFPKGFNGL DWDTGQSRML LYSGKAAMEL MGTWQIPIVK AEKKEFYEKN LDFFPFPSIK GGKGDPTSLV GMASSNYYAV TTTCKYPKEA FNMIQYLIDD QAVKERINYG QIPPVKGLKF SDPMLQKVYN IVAKAKSIQL WWDQYLPPQL AETHKDIVQS LFGLTITPKA AAEKMEKAAK EYFKK
|
| |