Gene GWCH70_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1620 
Symbol 
ID7976268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1693698 
End bp1695113 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content44% 
IMG OID644798504 
ProductPTS system, trehalose-specific IIBC subunit 
Protein accessionYP_002949676 
Protein GI239827052 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR01992] PTS system, trehalose-specific IIBC component
[TIGR01996] PTS system, sucrose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000938635 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGCAT ATGAACAATC GGTTGCTAAA ATTGTCGAAG CAATTGGTGG AAAAGAAAAT 
ATTGTTGCCG CCACCCATTG TGTCACGCGT TTGCGTTTTG CGTTGAAAGA TGAGGGGAAA
GTCGATAAAG AGAAATTAGA AAGCATTGAT ATTGTAAAAG GTTCATTTTC CGCGAACGGT
CAATTTCAAG TAGTTATCGG ACAAGGGCTT GTCGATAAAG TATATAACGA AATGGTGGAA
ATGACCGGCA TTGGAAGAGC GACAAAACAA GAGATTAAAG ATGCGGCAGA AGCGAAGTTA
AATCCGCTGC AACGCGCCAT TAAAACATTA GCAGATATCT TTATCCCGAT TTTGCCGGCG
ATTGTGACAG CTGGTTTGTT AATGGGGATT AACAACATCT TAACAGGTCC GGGTATTTTT
TATGAAGGCA AATCGTTTGT CGAAGTGCAC AAAGAATGGG CAGATCTTGC TAGCATGATT
AACCTTATTG CAAATACGGC GTTCGTCTTC TTGCCTGGCT TAATCGGATG GTCGGCAGTG
ACAAAGTTTG GCGGAAGCCC GCTTTTGGGA ATTGTCCTCG GTTTGATGCT TGTCCATCCT
GATTTGTTAA ATGCATGGGG ATGGGGAGCA GCGAAAGAAA AAGGGGAAAT TCCTATTTGG
AATTTATTCG GATTTGAAGT GCAAAAAGTC GGATATCAAG GCCAAGTGCT GCCGGTGCTT
GTAGCGTCTT ATGTACTTGC GAAAATCGAG CAATTTTTAC GTAAACGTAT ACCGGATGCA
TTTCAATTGT TGCTTGTTGC ACCGCTTGCG TTATTAATTA CGGGCTTTTT AGCATTTATT
GCAATTGGAC CGATTACGTT TGCGATCGGA AATGCGATTA CAAATGTATT TGTCAGCATT
TTTGATAACG TTCCAGCGAT TGGCGGCTTT TTGTATGGGG CATTATACGC ACCGCTCGTT
GTTACGGGAA TGCATCATAC GTTTTTACCG GTCGATTTGC AGTTGATTGC AAGCACAGGT
GGTACGTTCT TATGGCCGAT CCTTGTCATG TCAAACGTTG CCCAAGGTTC TGCGGCATTA
GCAATGATGT TTGCTGCAAA GGATGAAAAG TTAAAAGGTC TTTCTTTCAC TTCCGCAGTA
TCTGCTTATC TTGGCATTAC CGAACCGGCG ATGTTTGGGG TAAACTTGCG TTTCCGTTAT
CCGTTCATTT CGGCGATGAC GGGTGCGGCG ATTGCCGGAA TGTTTATTAC ACTAAATAAA
GTCATCGCTC CATCGATTGG CGTTGGCGGT TTGCCAGGGT TTTTATCGAT CGTACCGCAA
AAGTGGGCAC CATTCTTTAT CGGAATGGCA ATCGCCATTA TCGTACCGTT TGCCTTAACG
TTTGTATTCA GCAAGTTCCG CAAAGAGAAT CGCTAA
 
Protein sequence
MGAYEQSVAK IVEAIGGKEN IVAATHCVTR LRFALKDEGK VDKEKLESID IVKGSFSANG 
QFQVVIGQGL VDKVYNEMVE MTGIGRATKQ EIKDAAEAKL NPLQRAIKTL ADIFIPILPA
IVTAGLLMGI NNILTGPGIF YEGKSFVEVH KEWADLASMI NLIANTAFVF LPGLIGWSAV
TKFGGSPLLG IVLGLMLVHP DLLNAWGWGA AKEKGEIPIW NLFGFEVQKV GYQGQVLPVL
VASYVLAKIE QFLRKRIPDA FQLLLVAPLA LLITGFLAFI AIGPITFAIG NAITNVFVSI
FDNVPAIGGF LYGALYAPLV VTGMHHTFLP VDLQLIASTG GTFLWPILVM SNVAQGSAAL
AMMFAAKDEK LKGLSFTSAV SAYLGITEPA MFGVNLRFRY PFISAMTGAA IAGMFITLNK
VIAPSIGVGG LPGFLSIVPQ KWAPFFIGMA IAIIVPFALT FVFSKFRKEN R