Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0701 |
Symbol | |
ID | 7978879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 774005 |
End bp | 775303 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644797685 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002948859 |
Protein GI | 239826235 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AGAGAAGGGG TATCTCAATT TTGGCAGTGA TGGTGCTGGT ATTCAGTTTG CTGGCGGCTT GCGGACCAAC ATCATCCCAC AAAAGTGTCG GGAAGAATGA ACCGGCAAGT ACAAATGAAG GAGACAGTAA AGAAATTAAA CCAGAAAAAG GTGCTAAACT GAAAGTCTGG GAATCTGGCG GAGCAATTGG TGAATGGACC AAATATGTGG CTGAAGAATT TACCAAAAAG TATGGGGTGC CGGTAACTTT TGAAGAAGTA GGACATACTG ATGCGCCTGA AAAGCTCAAA ACGGATGGTC CGGCTGGACT TGCGGCAGAT GTGTTTGCTG CTCCACACGA TCATCTAGGA TCTATGGTAC CAGCAGGATT GGTTTTAGAA AACTTTTTCC CAGAAGAGTA TCAAGATAAG TTTATGAAAT CTGCTATTGA GGGGACGACA GTTGATGGAA CTCTTTACGG CTATCCTACG TCTATTGAAA CATATGCTTT ATTCTACAAC AAAGATCTGG TAAAGAAATT ACCTAAAACC ATGGATGAAC TGATTACACA AGCTAAAGAA TTGACTGATA TTAGAAACAA TAAATATGGA TTCATGATGG AAGTAGCGAA CTTGTATTTT GTATACTCAT TTATCGGCGG ATACGGTGGT TATGTATTCG GTGATAACAA TACCAATCCG AAAGACATTG GTTTGAATAA TGAAGGAGCT GTAAAAGCAG GAAAGTTAAT GCAACGTATT CATAAAGAAA TCTTGCCTTT GAAGGTGGAA GATATTACTT ATGATGTGAA ACAATCTTTA TTTAACGAAG GAAAACTAGC TTTCAATATT GATGGACCTT GGGCTGTAGC GGGCCATCGC GATGCTGGCG TGAACTTCGG TGTCATACCG TTGCCTAAAT TGGAGAACGG TCAAACACCT ACAAGTTTTT CTGGTATTCG CGCATTTTAT GTAAATGCTT ATACAAAGTA TCCGAACGCA GCCTCCTTAT TCGCCAAATT TGCAACGAGT GAAGAAATGC TACTAAAACG TTTTGAAATG ACTGGGCAAT TGCCACCGGT TCAATCTCTT CTAGACAATG AAACTATAAA AAATGATGAA ATTGCATCGG CATTCTTGGA GCAAGCAAAA TATGCAGTGC CAATGCCAAA CATTCCACAA ATGCCAATGG TCTGGGAACC TATGGCATCT GCTCTCACTA CGATTTGGAA TGATGGAAAA GATCCAAAAG AAGCTCTGGA TGCGGCGGTT GACCAGATTA AAGCGGGAAT TGCCACTCAG GGTCAGTAG
|
Protein sequence | MKKKRRGISI LAVMVLVFSL LAACGPTSSH KSVGKNEPAS TNEGDSKEIK PEKGAKLKVW ESGGAIGEWT KYVAEEFTKK YGVPVTFEEV GHTDAPEKLK TDGPAGLAAD VFAAPHDHLG SMVPAGLVLE NFFPEEYQDK FMKSAIEGTT VDGTLYGYPT SIETYALFYN KDLVKKLPKT MDELITQAKE LTDIRNNKYG FMMEVANLYF VYSFIGGYGG YVFGDNNTNP KDIGLNNEGA VKAGKLMQRI HKEILPLKVE DITYDVKQSL FNEGKLAFNI DGPWAVAGHR DAGVNFGVIP LPKLENGQTP TSFSGIRAFY VNAYTKYPNA ASLFAKFATS EEMLLKRFEM TGQLPPVQSL LDNETIKNDE IASAFLEQAK YAVPMPNIPQ MPMVWEPMAS ALTTIWNDGK DPKEALDAAV DQIKAGIATQ GQ
|
| |