Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2032 |
Symbol | |
ID | 7408245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2144650 |
End bp | 2146245 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716399 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002573882 |
Protein GI | 222530000 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGGTA AAAAGAATTT AAAAGTCTTG AGGTTCTTAA TTATTTTTGT AGTGTTTGTC CTTGTGGTTG GTTCGTTAGC AATAAATCCT AAAGAGAGCT TGGCAGCTAC AAAACCCACT ATCACATATT TTGTCCAGAT GGATGCAAAA GTTGCAGTTT CGTATGATAA CTATAGTAAG ATTGCGGCAT ATCAACTTCT TATGAAAAAG CTAAATGTGA ATATTCAATT TATCCATCCT CCAATGGGAG GTACTGCTGC CCAAGACCAG CTGAATTTAA TGATTGCATC CAAAAAACTT CCTGATATCA TTTATTGGAA TTGGGTGGAT AGTTATCCAG GCGGTCCTGT AAAAGCTTTA CAAGACAAGG TAATTATAAG ACTTAATGAA TATGTGGACA AATATGCTCC AAATTTTAAA TCATATTTGT CAAAACATCC TGATGTGAAA AAGATGATAG TGACAGACGA TGGTGATTTG TATTGTTTTC CCTATTTAAG AGAGGACCCT GAGATTCAAG GTACTTTTTA TGGACCAATT GTAAGAAAAG ATTGGTTGAA CAAATTAAAA ATTAATCCAC CTGAAACTGT GGATGAATGG TACAAGATGT TAAAAGCTTT TAAAGTTAAC GACCTCAATG GGAATGGTAA AAATGATGAA AGGCCGTTTT CTATTAGTTT AGGCGGTGCA ACAAGTCCAA GACGAGCTTT TGATTATTGC AGCTTTTTAG TAGGTGCGTG GGGTATCAAA ACTGATTTCT TTGTCGAGAA TAATAAGATT CAATACGGTC CTTTAAAGCC TCAATATACA GATTTTATAA AGACACTTCA AAAGTGGTGG AAGGAAGGGT TAATTGACCC AGATGTACTT ACTATGAACA GAGATATAAT CAGAGCAAAT ATTCAGAACG ATTTAATAGG TTCATTTTTA GGTTTAATTG GTGGAGACTT AGCTTTCTTT GTAAATCTAA AGAAAGATTT AATGGGTGTT AAGTATCCTG TTCTTAAGAA AGGGCAAAAA CCAGAGTTTA GCCATCGTGA ACCCCAGTTT GCAAAAAGTG GAGCTGCTAT TACCACGTCC TGTAAAAATA TTCCGCTTGC AATGAAGGTT CTTGATTGGG GATGGAGTAA GGAAGGATTT ATGGCACTCA ATTTTGGCGT GTTAGGAAAA AGTTATGTTA TAAAAGATGG ACGTCCGGTG TACACTGATG AGGTTATGAA CAACCCACAG CTTGATAGGC CTTCTGCACT TGCAAGATAT GCATGTGCTT CGTTTGGGGG GCCATTTATT CAGGCAAAAG AAAATGCTCT TCAGATAGGC TTAGGGCTCC CACAACAAAA AGAAGCAAGT GAGAACTGGA GATATGCATC TAACAAAAAA CTTTTGCCTA TTCTTTCATT TACATCTGAT GAAGCAAAGA AACTGGCAGA TATTATGAAT GTTATTAATA CTTATTATGA TGAGATGTTT GTAAGACTTA TGACAGGTAA ACTCAACGAT GTTGAACAGC TGAGAAAAGG ATTGAAAAGA ATGAGGATAG ACGAAGCAAT AAAGATATAT CAGCAAGCTT ACAACCGTTA CATCAACAGA AAGTAA
|
Protein sequence | MFGKKNLKVL RFLIIFVVFV LVVGSLAINP KESLAATKPT ITYFVQMDAK VAVSYDNYSK IAAYQLLMKK LNVNIQFIHP PMGGTAAQDQ LNLMIASKKL PDIIYWNWVD SYPGGPVKAL QDKVIIRLNE YVDKYAPNFK SYLSKHPDVK KMIVTDDGDL YCFPYLREDP EIQGTFYGPI VRKDWLNKLK INPPETVDEW YKMLKAFKVN DLNGNGKNDE RPFSISLGGA TSPRRAFDYC SFLVGAWGIK TDFFVENNKI QYGPLKPQYT DFIKTLQKWW KEGLIDPDVL TMNRDIIRAN IQNDLIGSFL GLIGGDLAFF VNLKKDLMGV KYPVLKKGQK PEFSHREPQF AKSGAAITTS CKNIPLAMKV LDWGWSKEGF MALNFGVLGK SYVIKDGRPV YTDEVMNNPQ LDRPSALARY ACASFGGPFI QAKENALQIG LGLPQQKEAS ENWRYASNKK LLPILSFTSD EAKKLADIMN VINTYYDEMF VRLMTGKLND VEQLRKGLKR MRIDEAIKIY QQAYNRYINR K
|
| |