Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1463 |
Symbol | |
ID | 7976909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1535639 |
End bp | 1536973 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644798367 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002949540 |
Protein GI | 239826916 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAGGTTTTGC GAAACTCATC GCCTTATTGC TGGCAGCCGT TCTTATTGTT ACTGGTTGTC AAGGGCAAAA TGGGAATCAA GAGAAAAACG CAAAAGATGA TGAAGCTGGC AAAGTTGTTC AATTTGAGTT CTGGGCTGCG CCAAACCCTA CACAACAAGC TTTCTGGAAA AAGATGGCCG AGGCTTATAT GAAAGAAAAT AAAAACGTTA AAATTAAGGT TACGCCAATG CCGGAAAGCC CTACTTCTGA AGCAGGGATT CAATCAGCGA TTGCATCAGG GAAAGCGCCT GCGGTCTCTG AAAACATTTC CCGCGGTTTT GCAGCTCAGC TAGCGGCGAG CCGTGCGATT GTGCCATTGG ATGAATTTGA AGGTTTTGAT GAGTTAATTG ATAAACGTCA AATGAAGGAA ACCATTTCAA GCTGGAAGTT TGCAGACAAC CATCAATATG TTTTGCCGAT TTATTCTAAT GCCATGTTGT TTGGTTGGAG AATCGATATT CTAAAAGAAC TTGGATACGA TGCACCGCCA AAAACATATA GTGAAGTCAT TGAAGTTGGC AAGAAATTGA AAGAAAAATA TCCTGATAAG TTTCTTTGGG CGAGAGCTGA TTTAGTAAAA CCGACATGGT GGGCAAGATG GTTTGACTTC TTTATGTTGT ACAATGCAGC ATCGAATGGA AATAACTTTA TTAAAGGAAA CAAATTTATT GCGGATGACG AAGCCGGTGT AAAAACACTC CAATTCTTTA ATGATTTAAG CAAGAATAAA TTGCTTTTGA CTCGAGAAGC GACTGACCCG TTTGAAACAG GTACATCCAT TATGGTGGAT CTTGGGCCAT GGACATTCCC ATACTGGGCT GAGAAATTCC CGGAAATGAA GTTCAATGAA ACTTATGTAT TATCGTTGCC GCCTGTGCCT GACGGCGTAG ATCCAGCAAA TTCTAAAACA TTTGCAGATA CAAAAGGTCT TGTCATCTAT GCTTCTGCAA GCAAAGAACA ACAACAAGCA GCTTTTGACT TTATTAAATG GGTATTTTCA GATGCGAAAA ATGATTTAGC TTGGTTCAAA CAAACAAACT TGCCACCTGC ACGTGATGAT TTATCAACTA ATGAAGCTTT TGCATCCTAT CTTGAAGAAA ACCCTCAATT AAAACAGTAT GCGGAAAACA TTCCAAACGC AATTCCACCT GTGGATAACG AAAAAACGGT AGAAATTCAA GAATTGATTG GTAAAGAGGC CTTAAATCCT GTTGTCAAAG GCCAAAAAGA TCCTGAAACA GCTTGGAAAG ATATGAAAAA GGCTGTTAAC GGGGTGTTAA AGTAA
|
Protein sequence | MKKKGFAKLI ALLLAAVLIV TGCQGQNGNQ EKNAKDDEAG KVVQFEFWAA PNPTQQAFWK KMAEAYMKEN KNVKIKVTPM PESPTSEAGI QSAIASGKAP AVSENISRGF AAQLAASRAI VPLDEFEGFD ELIDKRQMKE TISSWKFADN HQYVLPIYSN AMLFGWRIDI LKELGYDAPP KTYSEVIEVG KKLKEKYPDK FLWARADLVK PTWWARWFDF FMLYNAASNG NNFIKGNKFI ADDEAGVKTL QFFNDLSKNK LLLTREATDP FETGTSIMVD LGPWTFPYWA EKFPEMKFNE TYVLSLPPVP DGVDPANSKT FADTKGLVIY ASASKEQQQA AFDFIKWVFS DAKNDLAWFK QTNLPPARDD LSTNEAFASY LEENPQLKQY AENIPNAIPP VDNEKTVEIQ ELIGKEALNP VVKGQKDPET AWKDMKKAVN GVLK
|
| |