Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1392 |
Symbol | |
ID | 7978189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1462257 |
End bp | 1463594 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644798315 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002949488 |
Protein GI | 239826864 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GCATTTTTAT TTTGTTCACG TTCATTTTTA TGATACTGGC TGCTTGTTCA AATCAAAGTG ATGAAGCAAC GGCCACTCCA TCTGAAAACA AAGACGGGAA AACAGAGGTC ATTTTTTGGC ATGCGATGAG TGGCGATTTA GAAACAGTGT TAAAAGAGAT TGTGGAGGAA TTTAACCAGT CCCATCCTTC TATTAAGGTG AAGCCGATTT TTCAAGGTAC ATACGAAGAG GCGTTGACGA AATTTAATAC GGTTGCTGGA ACGAAAGACA CGCCAACGAT TATGCAAACA TTTGAAGTTG GAACAAAGTA TATGATTGAT AGCGGCAAAA TTCAACCTGT CCAAAAATTT ATTGATGAAG AGAATTATGA CATATCGCAA TGGGAAAAAA ATATCGTGAA TTATTATACG GTGGATGGAA AAATCTACTC CATGCCGTTC AACTCCTCCA CGCCTGTGTT GATTTACAAC AAAGATGCGT TCAAAGAAGC AGGATTGGAC CCTGAAAAGC CGCCGATGAC GTATAGCGAG TTAAAAGAAG CGGCAAGAAA ATTGACAAAA AAACAAGGGG GACAAACGAC TCAATATGGC TTTTCCATTT TAAATTACGG CTGGTTTTTT GAAGAAATGC TCGCGGTGCA AGGCGGGCTG TACGTCAATA ATGAAAATGG CCGCAAGGGA AACGCTACGA AAGCGATTTT TAATAGTGAA GAAGGATTGC GTGTATTCGA TTTGATTCGT GACATGTATA AAGAAGGAAC GTTCTACAAC GTAGGACAAA ACTGGGATGA TATGCATGCG GCGTTTCAAT CAAAGAAAGT CGCGATGTAT TTAGACTCTT CTGCAGGAGT GAAAACGATT ATCGACAATG CGCCGTTTGA GGTAGGAGTT TCTTACTTGC CAGTTCCAGA CGGTGTCGAG CGTCAAGGAG TGATCATTGG TGGAGCGTCG CTATGGATGG CCAAAAATAT TAGTGAAAAG CAACAAAAAG CGGCATGGGA GTTCATGAAA TATTTAGCAA CACCTGAAGT TCAAGCGAAA TGGCATGTGA AAACAGGATA TTTTGCGATT AATCCAGCTG CTTATGACGA AGAGATCGTG AAAGCGGAGT GGGAAAAGTA CCCTCAATTA AAAGTAACCG TCGATCAATT AAAACAAACA AAACCAATAC CTGCGACACA AGGAGCGCTT ATATCTGTTT TTCCGGAATC TCGTCAAAAA GTAGTGAAAG CAATGGAAAG CTTATACCAA GGTGTTGATC CGAAAGAAGC GTTGGATCGT GCAGCAGCGG AAACGAACCG AGCTCTTGAA GTAGCGAATA AAAAATAA
|
Protein sequence | MKKSIFILFT FIFMILAACS NQSDEATATP SENKDGKTEV IFWHAMSGDL ETVLKEIVEE FNQSHPSIKV KPIFQGTYEE ALTKFNTVAG TKDTPTIMQT FEVGTKYMID SGKIQPVQKF IDEENYDISQ WEKNIVNYYT VDGKIYSMPF NSSTPVLIYN KDAFKEAGLD PEKPPMTYSE LKEAARKLTK KQGGQTTQYG FSILNYGWFF EEMLAVQGGL YVNNENGRKG NATKAIFNSE EGLRVFDLIR DMYKEGTFYN VGQNWDDMHA AFQSKKVAMY LDSSAGVKTI IDNAPFEVGV SYLPVPDGVE RQGVIIGGAS LWMAKNISEK QQKAAWEFMK YLATPEVQAK WHVKTGYFAI NPAAYDEEIV KAEWEKYPQL KVTVDQLKQT KPIPATQGAL ISVFPESRQK VVKAMESLYQ GVDPKEALDR AAAETNRALE VANKK
|
| |