Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3155 |
Symbol | |
ID | 7977010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3181944 |
End bp | 3183248 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799941 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002951080 |
Protein GI | 239828456 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAA AAGGATTATG GTTGTCCTTA GCATTAACGT TCAGCTTGGC GTTAGCTGGC TGTAATTCCG ATTCCAATTC CGCATCCAAC AATAACAATA AAGAGGATCA AGGAGGAAGC AAAGGACACG AAAAGCTTGA AATATTCAGC TGGTGGACTG GCGCCGGAGA AGAAGATGGA TTAAAAGCGT TAATTAAATT GTTTCAAGAA AAATATCCAG ATATACCAGT AGAAAATGCC GCAGTAGCAG GCGGTGCCGG AACCAATGCG AAAGCGGTGC TAGCGAGCCG CATGCAAGGA AACGACCCTC CTGCGACATT CCAAGTGCAC GGCGGCGCGG AGTTGAATGA AGGATGGGTA GCTGCCGGTA AAATGGAGCC GCTCAACGAT TTGTATGAAA AAGAAGGTTG GATGGATAAG TTTCCAAAAT CATTGATCGA CATGGTCAGC AAAGACGGGA AAATTTATTC CGTTCCTGTG AATATTCACC GCGGCAATGT GCTTTGGTAT AACAAAAAAA TCTTTGCCGA AAATGGCCTC CAGCCGCCAA AAACGTTTGA TGAATTTTTC CAAGTTGCCG AAAAGCTGAA AGCGAAAGGC ATTACTCCGC TTGCGTTGGG GGACAAAGAA CCTTGGACGG CAACGCATAT TTTTGAAACG GTTTTACTAG GCACGTTAGG AACAGAAGAT TACAAAAAGC TTTGGACAGG AGAATTATCG TTTGACGATC CAAAAGTAAA AGAAGCGGTC AACACGTTTA AGAAAATGTT GAATTACGTC AACGAAGACC ACAGCTCCCG CAACTGGCAA GATGCCGCTC AGTTGGTCGG AAAAGGCGAA GCGGCCATGA ACATTATGGG AGATTGGGTC AAAGGTTATT TTGTGAACGA TTTGAAATTA AAAGTCAATG AAGACTTCGG TTATGTGCCA ACTCCGAATA CAGAAGGCAA ATTTATGGTG ATCACCGATA CGTTTGGCTT GCCAAAAGGC GTGAAAAATC CAGAAGATGT GAAGAAATTT TTATCGGTGC TTGGTTCCGT GGAGGGACAA GATACGTTTA ACCCTCTGAA AGGCTCGATC CCAGCTCGTA TTGATGCGGA TCTGTCCAAA TATGACGAAT ATGGAAAACA AACAATTCAA GATTTTAAAA CAGCAGAACT TGCACCAAGC TTAGCGCATG GATCCGCAGC GCCGGAAGGA TTTGTGACAA AAGTCAACCA AGCAGTCAAT ATTTTCGTGA CGCAAAAAGA TGCGAAAACG TTTATTGATA CACTTGTATC GGCTTCGTCA GAATTGAAGA AATAA
|
Protein sequence | MKKKGLWLSL ALTFSLALAG CNSDSNSASN NNNKEDQGGS KGHEKLEIFS WWTGAGEEDG LKALIKLFQE KYPDIPVENA AVAGGAGTNA KAVLASRMQG NDPPATFQVH GGAELNEGWV AAGKMEPLND LYEKEGWMDK FPKSLIDMVS KDGKIYSVPV NIHRGNVLWY NKKIFAENGL QPPKTFDEFF QVAEKLKAKG ITPLALGDKE PWTATHIFET VLLGTLGTED YKKLWTGELS FDDPKVKEAV NTFKKMLNYV NEDHSSRNWQ DAAQLVGKGE AAMNIMGDWV KGYFVNDLKL KVNEDFGYVP TPNTEGKFMV ITDTFGLPKG VKNPEDVKKF LSVLGSVEGQ DTFNPLKGSI PARIDADLSK YDEYGKQTIQ DFKTAELAPS LAHGSAAPEG FVTKVNQAVN IFVTQKDAKT FIDTLVSASS ELKK
|
| |