Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2378 |
Symbol | |
ID | 7407797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2529621 |
End bp | 2531081 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716741 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002574220 |
Protein GI | 222530338 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATG AAGCTTCTCT ATTTTACAAA AGTTTTGCTG AAATACCATG TTATCAGTTG ATGGAAAAAT CTACAGGAGT AAAATTTGTT TTTAAACATC CACCTCTTGC TACATCTGCT GCACAAGACC AGTTCAATTT AATGATAGCT TCTCGACAGT TAACTGACAT AATAGAATGG GGATGGGATG GCTATCCTGG AGGTCCTGAA AAAGCCATCA TTGACAAGGT AATTGTGCCG TTGAATGATT ACATACCCAA GTATGCACCA AATTTAAAAA GACTTCTTGA TAAAAACCCT CAAATCAAAA GGATGGTAAG CTCAATTAGT GGAAAAATCT ATGGTTTCCC TGCTTTAAAA GAAACGCCAA TAGATGCATA TTACGGTCCT CAGGTTAGAA GGGATTGGCT TGAAAAACTT AAAATTGCTC CGCCAGAGAC AGTAGATGAA TGGTATAAAA TGTTGAAGGC GTTTAAAACC AGAGACCCGA ACGGAAATGG AAAAGCAGAC GAAAGACCTT TTTCAATGTT AAGAGGTGCT GCAAATCCGA GAGCTGTTTT TGACTATTGC AGCTTTTTAG TTGGGGCGTG GGGAATAAAA ACAGACTTCT TCCAAGTAAA TGGAAGAGTT AAATATGGAG CAATTGAACC TCAGTTTAAA GAGTTTATGA ATACTTTGGC AAAATGGTGG AAAGATGGGT TGATTGATCC AGATATACTG ACGATGAATC AACAAACAAT TCAAGCAAAT GTTTTGAGCG ACAAAATTGG AGCATATCTG GGGATAATTT CGGGGCATAT GGGTGCCTTT TTAGCAGCAA AGAAAGGGAC AGACTTTGAT TTAATAGGTG TGAAATATCC AGTACTGAAA AAAGGTGAAA TAGCACGAAT CGGTCAAAAT GAGTATCCTT TTACGGGAAG AGCAGCAGCA ATTACAACCA GCTGTAAGAA CATAGAGGCA GCATGCCGTG CGCTCGACTG GGCTTATAGC AAGGATGGGT ATATGGCTTT TAATTTTGGT GTAAAAGGAA AATCTTATAT GATTAAAAAC GGCCGACCAA TTTATACCGA TGAAATTCTT TACAACCCGC AAGGATTAGG GCCAAAACAG GCATTAGCTA AATATGCGCT GATTTATGGT CCATTTGTCC AATCCAGGGA GTATACATTA CAAATCAACT TGCAGTTGCC TCAACAAAAA GAAGCTTCAA AGAATTGGGG TATGGTTAAA AATGATATTG CATTAGGTCC AGTTTCGCTT TTCTTAACCC CAGAAGAGAC TAAAGAAATT GCAAATATTA TGAATACCAT AAATACTTAT TATGATGAGA TGTTTTTGAA GATGATGACA GGCAAGTATA ATAATTATGA TGCTTTTGTA AAAACTCTAA AGAAAATGAA GATAGAAGAA GCTATAAAGA TTTATCAGAA TGCTTATAAC AGATATATGC AAAGAAAATG A
|
Protein sequence | MSNEASLFYK SFAEIPCYQL MEKSTGVKFV FKHPPLATSA AQDQFNLMIA SRQLTDIIEW GWDGYPGGPE KAIIDKVIVP LNDYIPKYAP NLKRLLDKNP QIKRMVSSIS GKIYGFPALK ETPIDAYYGP QVRRDWLEKL KIAPPETVDE WYKMLKAFKT RDPNGNGKAD ERPFSMLRGA ANPRAVFDYC SFLVGAWGIK TDFFQVNGRV KYGAIEPQFK EFMNTLAKWW KDGLIDPDIL TMNQQTIQAN VLSDKIGAYL GIISGHMGAF LAAKKGTDFD LIGVKYPVLK KGEIARIGQN EYPFTGRAAA ITTSCKNIEA ACRALDWAYS KDGYMAFNFG VKGKSYMIKN GRPIYTDEIL YNPQGLGPKQ ALAKYALIYG PFVQSREYTL QINLQLPQQK EASKNWGMVK NDIALGPVSL FLTPEETKEI ANIMNTINTY YDEMFLKMMT GKYNNYDAFV KTLKKMKIEE AIKIYQNAYN RYMQRK
|
| |