Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1397 |
Symbol | |
ID | 7976711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1468547 |
End bp | 1469446 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644798319 |
Product | periplasmic solute binding protein |
Protein accession | YP_002949492 |
Protein GI | 239826868 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.682117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT GGTGGATGAG TTTGGTTTGC GTATTTTTTT TATTTGGATG CAGCAATGAA AAAAGCGTGA GTGACAAAAT AAACATCACG GTTACCACTG GTCAAATTGC CGATATTGTT GAACACGTCG GCGGTGATCA CGTTCATGTC GAGGCGCTCA TGGGACCAGG AGTTGACCCG CATTTATACA AAGCATCGCA AGGAGATATC CAAAAATTAA GTTCGGCTGA TTTGATTTTT TATAACGGTC TTCATTTAGA AGGAAAAATG GGCGAAATTT TTGAGAAGAT GGAAAAAGAA AAGAAAGTTG TTGCCGTGGC TGAAGCGATT CCGAAAGAAA AGCTCATTCA AATGGACGGA ACGTACGATC CACATGTTTG GTTTGATCTT GATTTATGGT CATATGCAGT AAAAGCGGTG CGCGATGAAC TCATTGAATT CGATCCAACG CATAAAGAAG ATTATGAGAA AAATGCGAAA GCGTATATAG CGCAATTGAT GGAGTTAAAA CAAGAAGCAC AAAAAGAGAT CAGCTCGATT CCGAAACAAC AGCGCGTCAT GATCACGGCG CACGATGCGT TCCACTACTT TGGGCGCGCA TATGACATGG AAGTGATCGG CTTGCAAGGG TTGAGCACCG ATGCGGAATA TGGATTAAAG GACGTGCAAG AGCTCGTCAA TACGATTGTT GAACGGAACA TTAAAGCGGT ATTTGTGGAG AGCAGTGTAT CGAAAAAAGC GATTCAAGCA GTTGTCGAAG GAGCGAAACA GCGCGGTCAC GATGTGAAAA TTGGCGGTGA GCTCTTTTCT GATGCACTTG GTGAGAAAGG AACGGAAGAG GGAACGTTTG TTGGAATGTA CCGTCATAAC GTTAAAACGA TTGTAAAATC ACTGAAGTAG
|
Protein sequence | MKKWWMSLVC VFFLFGCSNE KSVSDKINIT VTTGQIADIV EHVGGDHVHV EALMGPGVDP HLYKASQGDI QKLSSADLIF YNGLHLEGKM GEIFEKMEKE KKVVAVAEAI PKEKLIQMDG TYDPHVWFDL DLWSYAVKAV RDELIEFDPT HKEDYEKNAK AYIAQLMELK QEAQKEISSI PKQQRVMITA HDAFHYFGRA YDMEVIGLQG LSTDAEYGLK DVQELVNTIV ERNIKAVFVE SSVSKKAIQA VVEGAKQRGH DVKIGGELFS DALGEKGTEE GTFVGMYRHN VKTIVKSLK
|
| |