Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0756 |
Symbol | |
ID | 7979307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 831762 |
End bp | 833411 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644797734 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002948908 |
Protein GI | 239826284 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAGA AACTATCATT GTTTCTTGTG CTTCTCCTGG CTGTAACCAC ATTTCTAGCG GCCTGTGGAG GCAACAACGA CACAGCAAAA GACAAAGGCG GCACCGCAAA TAAGCCGGCC GAAAAGAAAG AGCAAGTGTT GAACTTGCTG GATTCTTCGG AAATTCCATC GCTTGATTCC GCGCTTGCGA AAGACCAAGT ATCGTTCATC GTGTTGAACA ACGTGATGGA AGGCCTTTAC CGTTTAGGCA AAGATAACAA GCCGGTTCCA GGTGTGGCGG AAAGCTATGA AGTAAGCGAA GATGGCAAAA TGTACACGTT TAAACTTCGC AAAGACGCAA AATGGTCGAA CGGCGACCCT GTAACGGCGC ATGACTTCGT ATTTGCGTGG AGAAAAGTAT TAGATCCAAA AACAGCTTCC GAATACGCCT ACATTATGTA TGACATTAAA AACGCGGAAG AAGTCAACCA AGGCAAATTG CCAGTTGACC AGCTTGGCGT AAAAGCGGTA GATGACTATA CGCTTCAAGT AGAATTGAAA AAACCAATTC CATACTTCAT CAGCTTAACC GTATTTGGAT CGTTCATGCC GCAAAACGAA AAATTCGTAA AAGAGCAAGG CGACAAATAT GGCTTGGAAG CGAACACGAC GCTTTACAAC GGTCCATTCG TATTAAGCGA ATGGAAGCAT GAACAAGGCT GGACATATAA GAAAAATCCA AACTATTGGG ATAAAGACAA TGTGAAGCTA GAAACAATCA ATGTCAAAAT CGTAAAAGAT ACCGCAACAG CTGTAAACCT TTACGACACG AAAAAAGTGG ACCGAGTTGG TCTAACTGCA GAGTTTGTTG ATAAATATAA AAACGACAAA AACTTCCATA CAGAGCTTGA TCCATCTATT TTCTGGCTGC GCATGAACAC GAAAAACGAA TTGTTGAAAA ACGTCAACGC TCGTAAAGCA ATCGCCATGG CGATTGACAA ACAAGCGCTT GTTGACACGC TTCTTAACAA CGGTACAATT CCGGCAAACT ATATTGTTCC AAAAGACTTT GTTAAAGGTC CAAACGGAAA AGATTTCCGT GACGAAAACG GCGATTTAGT GAAATACGAT GTAGAAGAAG CGAAAAAATT ATGGGAACAA GCGAAAAAAG AGCTTGGCAA AGACAAATTT ACGATTGAGC TGTTGAACTT TGATTCCGAT ACCGCGAAGA AAACTGGTGA ATACTTGAAA GAGCAGCTTG AAAAGAACTT GCCTGGTCTT ACGGTCAACA TTAAACAACA ACCGTTCAAA CAAAAGCTTG AGTTAGAAAG CAACATGCAA TATGACCTAT CCTTCTCTGG CTGGGGCCCA GACTATCAAG ACCCAATGAC ATTCCTCGAT CTTTGGGTGA CAAACAACCC GCACAACCAA ACAGGCTGGT CCAACCCAGA GTACGACAAG CTTGTTAAAG ATGCGAAAAC AACGTTGCTA AGCGACTTGC AAGCCCGCTG GGATGCAATG CTAAAAGCAG AAAAACTCTT GTTTGAAGAA ATGCCAGTCG CACCGCTTTA TCAACGCGGC TCTGCGTATT TGCAACGTGA ATACGTAAAA GGTATTGTTT CTCATCCATT TGGCGGAGAT TATAGTTATA AATGGGCATA TATCGAGTAA
|
Protein sequence | MKKKLSLFLV LLLAVTTFLA ACGGNNDTAK DKGGTANKPA EKKEQVLNLL DSSEIPSLDS ALAKDQVSFI VLNNVMEGLY RLGKDNKPVP GVAESYEVSE DGKMYTFKLR KDAKWSNGDP VTAHDFVFAW RKVLDPKTAS EYAYIMYDIK NAEEVNQGKL PVDQLGVKAV DDYTLQVELK KPIPYFISLT VFGSFMPQNE KFVKEQGDKY GLEANTTLYN GPFVLSEWKH EQGWTYKKNP NYWDKDNVKL ETINVKIVKD TATAVNLYDT KKVDRVGLTA EFVDKYKNDK NFHTELDPSI FWLRMNTKNE LLKNVNARKA IAMAIDKQAL VDTLLNNGTI PANYIVPKDF VKGPNGKDFR DENGDLVKYD VEEAKKLWEQ AKKELGKDKF TIELLNFDSD TAKKTGEYLK EQLEKNLPGL TVNIKQQPFK QKLELESNMQ YDLSFSGWGP DYQDPMTFLD LWVTNNPHNQ TGWSNPEYDK LVKDAKTTLL SDLQARWDAM LKAEKLLFEE MPVAPLYQRG SAYLQREYVK GIVSHPFGGD YSYKWAYIE
|
| |