Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2554 |
Symbol | |
ID | 7409505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2672878 |
End bp | 2674524 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643716918 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002574395 |
Protein GI | 222530513 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00103966 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAA GAAGAGTTGC TTTATTGGTT GCTATTGCCT TTTTGATAAC CATCATTGTT CCAGGGTTTT TAAGTACTCC AACAAAGGCA GTAGCAGCAT CTAAAAACCC AATAGTTTTT AAGATTTATT GGGGTGATTC TAATGCAGAG CCGGTTGATG TGTGGAAGAC ACCAATTGGT AAAAAGGTTG AACAAATAAC AGGTGTAAGG CTTCAATTTG AATTTATTGT TGGCAGTGAC GAGGAAACAA AGGCAGGTAT CATGCTTGCA AGTGGCGACT TGCCAGACTT AATCAATGCA CACAATGTTG TAAACAAGTT TATTGAAGCA GGAGCTTTAG TTCCACTGGA TAATTACATT GCTAAGTACG GCAAGAACAT CAAAAAGTGG TATGATACAA AAGCGCTTAA AAAACTCAAA TATCCAAAGG ATGGGCACAT TTACTATCTC ACACCTTTCA GAGAAGAGTC TGACCCGCTC TATTCGTTTG CTGGTTTTTG GCTGCCTATA TACGTTCTGA AAGAAAACAA ATGGCCAGTT GTAAGAGATA TTGACACATA TTTTAAGATT GTAAAAGATA CTGTCAAGAA ACATCCAACA TACAATGGAA AGCCAACGAT AGGTTTTACA GCGCTGACTG ATAGCTGGAG AATTTATGTA CTGATGCAAC AGCCGCTGAG GCTTGAAGGC TATCCGAACG ATGGTGGCTG GCTGATCGAC GAAAAGACTG GAGTTGTAAA AGACAGCTAT ACAATGCCAT ATGCAAAGAC ATATTACAAA ATACTCAACC AGATGTGGAA CGAAGGTCTT CTTGACAAAG AGATGTTCTC ACAAAACTAT GATCAGTACT TAGCAAAGAT TTCGTCAGGT AGGGTTGTTG GTTTTTATGA TGAAAGATGG CAGATACAGT CTGCAATAGA CTCTCTTGAA AAACAAGGAC TTTATGACAG AATTCCAATT GCAATGCCAG TGCTCAAGAA AGGCGTAAAG AGAGATAGAT ACAACGTGGT TACAATGGGA ACAGGTGCCG GAATATCAAT TACAAAAAAG TGCAAGGACC CGGTTGCAGC CTTCAAGTTC TTGGACAGAA TGGCTGGCGA GGATATCTTG AAACTCATCA ACTGGGGTAT CCAAGGCCAG GATTACTATG TAAAGAATGG TAAAATGTAC AAGGATGCAA AACAGATTCA AAACTACATG AACCCAGATT ACAGAAAGAA ACAGGGCATT GGCGGAAATA TCTGGTTTGC ATTCCCAAGA CCACCGTTTG ACTGGACATA TTCAGACAAG AGCGGAAAGA TTTCTTGGGA CTATTCAGAC CAGGCATTAG AGCAGAGGTA CAAACCATAT GAAAAGGAAG TTTTGAAAGC TTATAAGATT AAGTCGTTCA AAGACTTGTT CTCACCAACA TGGAACTCAC CGTACGGATA TGGCTGGGAT ATCAAGCTCC CAGACGACCT GCAGGCAATC CAGAACCAGG CTGATGACTT GCAGAGAAGG TACATCACAA AAGCTATAAT GGCAAAACCG GGTGATTACG ATAAGATTTG GAATGAATAC CTTAGCAAGA TGAAGAACAT TCCTATCAAG AAGGTAATTG ACTTTAGACA AAAAGAGATC CAGAGAAGAC TCAAAGAGTG GAACTAA
|
Protein sequence | MFKRRVALLV AIAFLITIIV PGFLSTPTKA VAASKNPIVF KIYWGDSNAE PVDVWKTPIG KKVEQITGVR LQFEFIVGSD EETKAGIMLA SGDLPDLINA HNVVNKFIEA GALVPLDNYI AKYGKNIKKW YDTKALKKLK YPKDGHIYYL TPFREESDPL YSFAGFWLPI YVLKENKWPV VRDIDTYFKI VKDTVKKHPT YNGKPTIGFT ALTDSWRIYV LMQQPLRLEG YPNDGGWLID EKTGVVKDSY TMPYAKTYYK ILNQMWNEGL LDKEMFSQNY DQYLAKISSG RVVGFYDERW QIQSAIDSLE KQGLYDRIPI AMPVLKKGVK RDRYNVVTMG TGAGISITKK CKDPVAAFKF LDRMAGEDIL KLINWGIQGQ DYYVKNGKMY KDAKQIQNYM NPDYRKKQGI GGNIWFAFPR PPFDWTYSDK SGKISWDYSD QALEQRYKPY EKEVLKAYKI KSFKDLFSPT WNSPYGYGWD IKLPDDLQAI QNQADDLQRR YITKAIMAKP GDYDKIWNEY LSKMKNIPIK KVIDFRQKEI QRRLKEWN
|
| |