Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1620 |
Symbol | |
ID | 7976268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1693698 |
End bp | 1695113 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644798504 |
Product | PTS system, trehalose-specific IIBC subunit |
Protein accession | YP_002949676 |
Protein GI | 239827052 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR01992] PTS system, trehalose-specific IIBC component [TIGR01996] PTS system, sucrose-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000938635 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGCAT ATGAACAATC GGTTGCTAAA ATTGTCGAAG CAATTGGTGG AAAAGAAAAT ATTGTTGCCG CCACCCATTG TGTCACGCGT TTGCGTTTTG CGTTGAAAGA TGAGGGGAAA GTCGATAAAG AGAAATTAGA AAGCATTGAT ATTGTAAAAG GTTCATTTTC CGCGAACGGT CAATTTCAAG TAGTTATCGG ACAAGGGCTT GTCGATAAAG TATATAACGA AATGGTGGAA ATGACCGGCA TTGGAAGAGC GACAAAACAA GAGATTAAAG ATGCGGCAGA AGCGAAGTTA AATCCGCTGC AACGCGCCAT TAAAACATTA GCAGATATCT TTATCCCGAT TTTGCCGGCG ATTGTGACAG CTGGTTTGTT AATGGGGATT AACAACATCT TAACAGGTCC GGGTATTTTT TATGAAGGCA AATCGTTTGT CGAAGTGCAC AAAGAATGGG CAGATCTTGC TAGCATGATT AACCTTATTG CAAATACGGC GTTCGTCTTC TTGCCTGGCT TAATCGGATG GTCGGCAGTG ACAAAGTTTG GCGGAAGCCC GCTTTTGGGA ATTGTCCTCG GTTTGATGCT TGTCCATCCT GATTTGTTAA ATGCATGGGG ATGGGGAGCA GCGAAAGAAA AAGGGGAAAT TCCTATTTGG AATTTATTCG GATTTGAAGT GCAAAAAGTC GGATATCAAG GCCAAGTGCT GCCGGTGCTT GTAGCGTCTT ATGTACTTGC GAAAATCGAG CAATTTTTAC GTAAACGTAT ACCGGATGCA TTTCAATTGT TGCTTGTTGC ACCGCTTGCG TTATTAATTA CGGGCTTTTT AGCATTTATT GCAATTGGAC CGATTACGTT TGCGATCGGA AATGCGATTA CAAATGTATT TGTCAGCATT TTTGATAACG TTCCAGCGAT TGGCGGCTTT TTGTATGGGG CATTATACGC ACCGCTCGTT GTTACGGGAA TGCATCATAC GTTTTTACCG GTCGATTTGC AGTTGATTGC AAGCACAGGT GGTACGTTCT TATGGCCGAT CCTTGTCATG TCAAACGTTG CCCAAGGTTC TGCGGCATTA GCAATGATGT TTGCTGCAAA GGATGAAAAG TTAAAAGGTC TTTCTTTCAC TTCCGCAGTA TCTGCTTATC TTGGCATTAC CGAACCGGCG ATGTTTGGGG TAAACTTGCG TTTCCGTTAT CCGTTCATTT CGGCGATGAC GGGTGCGGCG ATTGCCGGAA TGTTTATTAC ACTAAATAAA GTCATCGCTC CATCGATTGG CGTTGGCGGT TTGCCAGGGT TTTTATCGAT CGTACCGCAA AAGTGGGCAC CATTCTTTAT CGGAATGGCA ATCGCCATTA TCGTACCGTT TGCCTTAACG TTTGTATTCA GCAAGTTCCG CAAAGAGAAT CGCTAA
|
Protein sequence | MGAYEQSVAK IVEAIGGKEN IVAATHCVTR LRFALKDEGK VDKEKLESID IVKGSFSANG QFQVVIGQGL VDKVYNEMVE MTGIGRATKQ EIKDAAEAKL NPLQRAIKTL ADIFIPILPA IVTAGLLMGI NNILTGPGIF YEGKSFVEVH KEWADLASMI NLIANTAFVF LPGLIGWSAV TKFGGSPLLG IVLGLMLVHP DLLNAWGWGA AKEKGEIPIW NLFGFEVQKV GYQGQVLPVL VASYVLAKIE QFLRKRIPDA FQLLLVAPLA LLITGFLAFI AIGPITFAIG NAITNVFVSI FDNVPAIGGF LYGALYAPLV VTGMHHTFLP VDLQLIASTG GTFLWPILVM SNVAQGSAAL AMMFAAKDEK LKGLSFTSAV SAYLGITEPA MFGVNLRFRY PFISAMTGAA IAGMFITLNK VIAPSIGVGG LPGFLSIVPQ KWAPFFIGMA IAIIVPFALT FVFSKFRKEN R
|
| |