Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2330 |
Symbol | |
ID | 7407749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2471380 |
End bp | 2472981 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716694 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002574173 |
Protein GI | 222530291 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTGGT TCAAATCTTC TAAAAAGGTT TTAAGTATTA TCGTAGTTAT TGCATTTGCC TTATCGCTTG TAATTCCAGC ATTTATTTCA TCAACCTCAA TAGCTTATGC AAAAACAATA CCAACACTTA CCTATTTTGT TCGTCTTGAC CCCAAGGTTG CAACATCTTA CAACAGCTAC TCTTCAATTG CTGCTTACCA GCTTTTGCAG AAAAAACTTG GGGTAAAGAT TGTGTTCAAG CACCCACCGG TTGGCGGAGA GACAGACCAG TTCAACTTAA TGGTTGCATC AAGACAGCTG ACAGACATCA TTGAGTGGAA CTGGGTTGAT AACTATCCTG GTGGACCTGT AAAAGCAATG CTTGATAAGG TAATCATAAG GCTCAATGAC TATATGCCAA AATATGCTCC AAATTTATCA AAATACTTAC AGCAACATCC TGACATCAAA AAACTTATTG TTACAGACGA TGGTGACATT TACGGATTCC CAGCTCTTCG TGGAACAAAC CCAAAAATTG CATGCGTATA CTATGGACCT CAAATACGAA ATGATTGGTT GAAAAAGCTC GGATTAAAAG AACCAGAAAC AGTTGATGAC TGGTACAAGG TTTTGAAAGC ATTTGTAACA AAAGACCCAA ACGGCAATGG CAAAAAGGAT GAAAGAGGAT TTACAATTCT GCGAAATGCT TCAAACCCGA GATATGCATT TGATTATTCT TCCTTCTTAG TTGGTGCATG GGGAATAAAA ACAGATTTCT TCCAGATAAA TGGAAAGGTC AAATATGGTC CGTTAGAACC ACAATACAAA CAGTTTATAG CAACACTTCA GAAGTGGTGG AAAGAGGGCC TCATAGACCC GGATATCCTA ACAATGAACC AGAAGGTTAT AAGAGCAAAT GTTCAAAACG ATGTAATTGG TGCGTGGATA GGACTTCTTT CTGGCGATAT GGGCTTCTTC TTGAACCTGA AGAAAGATAT AATAGCTACC AAGTTCCCTG TGCTCAAGAA AGGTGAACAG CCACTTTTAG GACAGGCTGA GTTCTTGTTC GCTCGAACAA GCGCGGCTAT AACTACTGCA TGTAAAGACA TACCAACTGC TATGAAGGTG CTTGACTGGG GTTACAGCAA AGAAGGATAT GAAGCGTTCA ACTATGGTGT ACTTGGAAAG TCTTATATTA AGAAAGATGG CAAGGTTTAC TATACAGATG AAATCTTGAA AAACCCACAG GGACTTTCTG CAGCAGAAGC TTTAGCAAAA TATGCTCGTG CATCAATCAG CGGTCCTTTT GCTCAAGCTG ATGAGTATTA TCTACAGATT CAAATGATGT ATCCACAACA AAAAGAAGCT GTCATGCAAA AATGGTCTGA TGTCAAAAAT GACAGAATTT TGCCACCACT TTCGTTTACA GACGAAGAAT CCAAGAGACT TGCAAATATT ATGAATACAG TCAACACGTA CTATGATGAA ATGTTCTTAA GACTTATGAC TGGAAAAGCA ACAAATGTTG ATGCATTTGT AAAAACTCTT AAACAAATGA AGATTGATGA GGCTATTAAG ATTTATCAAG CTGCATATGA CAGATGGAAA AAGAGAAAAT AA
|
Protein sequence | MDWFKSSKKV LSIIVVIAFA LSLVIPAFIS STSIAYAKTI PTLTYFVRLD PKVATSYNSY SSIAAYQLLQ KKLGVKIVFK HPPVGGETDQ FNLMVASRQL TDIIEWNWVD NYPGGPVKAM LDKVIIRLND YMPKYAPNLS KYLQQHPDIK KLIVTDDGDI YGFPALRGTN PKIACVYYGP QIRNDWLKKL GLKEPETVDD WYKVLKAFVT KDPNGNGKKD ERGFTILRNA SNPRYAFDYS SFLVGAWGIK TDFFQINGKV KYGPLEPQYK QFIATLQKWW KEGLIDPDIL TMNQKVIRAN VQNDVIGAWI GLLSGDMGFF LNLKKDIIAT KFPVLKKGEQ PLLGQAEFLF ARTSAAITTA CKDIPTAMKV LDWGYSKEGY EAFNYGVLGK SYIKKDGKVY YTDEILKNPQ GLSAAEALAK YARASISGPF AQADEYYLQI QMMYPQQKEA VMQKWSDVKN DRILPPLSFT DEESKRLANI MNTVNTYYDE MFLRLMTGKA TNVDAFVKTL KQMKIDEAIK IYQAAYDRWK KRK
|
| |