Gene GWCH70_1512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1512 
Symbol 
ID7976599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1584242 
End bp1586113 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content47% 
IMG OID644798409 
ProductPTS system, fructose subfamily, IIC subunit 
Protein accessionYP_002949582 
Protein GI239826958 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR00848] PTS system, fructose subfamily, IIA component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000350477 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA CCGATTTGCT TACAAAGGAA ACAATTATTC TCCATCTGAA AGCGAAAACA 
AAAGAAGAAG TAATTGACGA ACTTGTCGCG AAACTGCAAG AAGCAGGAGT GTTGCGTGAC
GCACAAGCGT TCAAAGAAGC GATTTTTGCG CGTGAAGCAC AAAGCACGAC CGGTGTCGGT
GATGGAATTG CCATTCCTCA TGCGAAAACA GCCGCGGTAA AGCGGCCTGC CGTAGCGTTT
GGCCGTTCTG AGAGCGGCAT TGACTATGAC GCGCTTGATG GAAAACCGAG CCGCTTGTTT
TTTATGATCG CAGCGCCAGA AGGAGCGAAT AATACACATT TAGAAGCGCT TGCCCGCTTA
TCGTCTATGC TAATGGATTC CTCTTTCCGC GCGCGGATTG AAAGCGTTTC AAATGAAGAG
GAATTTATTC GGTTGATTGC AGAAAAAGAG GCAGAGGAAA CGAAAGAAGC AGAACATACG
GCATCTTCAC CTTCGAAGCG CAAAAAAGTC ATTGCTGTCA CTGCTTGTCC GACAGGAATC
GCTCATACGT ATATGGCGGC AGATGCTTTA AAAGCAAAAG CAGCGGAAAT GGATGTGGAT
ATAAAAGTAG AAACAAACGG TTCTAGCGGA GTGAAAAACG AATTAACCAA GCAAGATATT
GAGGAAGCGG TTGCGGTCAT CGTTGCGGCG GATAAGCAAG TGGAAATGGA GCGGTTCAAA
GGAAAACATG TGATTCAAGT ACCGGTTGCG CAAGCGATTC GTAAACCGAA AGAGCTAATC
GAACAAGCTC TCCGGCAAGA TGCGCCGATT TATCAAGGAA GCGGCGCGAA AGAGGCGACA
ACGGTAGGAA AGCCGCGTAC AGGATTTTAT AAACATTTAA TGAACGGAGT TTCCAACATG
CTTCCGTTCG TTGTCGGCGG CGGTATTTTA ATTGCGATTT CATTTATCTT CGGTATTAAA
GCGTTTGATC CAAAAGATCC GTCGTATCAT CCGATAGCGA AAGCGCTTAT GGATATCGGT
GGAGGAAACG CCTTTGCGTT GATGATTCCA GTGCTTGCTG GTTTTATTGC CATGAGTATT
GCGGACCGGC CTGGTTTTGC GCCAGGGATG GTAGGCGGTT TCATGGCGGC AAATGGCGGC
GCCGGCTTTT TAGGCGGGTT AATCGCAGGC TTTCTTGCCG GGTATTTAGT GGTTGGGTTG
AAAAAAGTAT TTAGCCATCT GCCACAGTCG CTTGAGGGAA TCAAACCAGT GTTGCTTTAT
CCGCTTTTTG GCATTTTCAT TACGGGACTT ATTATGATGT ATGTTGTTAT CGATCCAGTG
AAGGCGTTAA ATGAGGCAAT GAAGCATTGG CTTGAAAATA TGGGAACGGC AAACTTAATT
TTACTTGGCG CGATTCTTGG CGGCATGATG GCGGTTGATA TGGGCGGCCC GATCAACAAA
GCGGCATTTA CGTTTGGAAT TGCCATGATT GATGCTGGAA ATTATGCTCC GCACGCAGCG
ATCATGGCCG GAGGAATGGT GCCGCCGTTA GGGCTGGCTC TTGCGACGAC ATTCTTTAAA
AAGAAATTTA CAAAAGCGGA ACGTGAAGCT GGAAAAACAT GCTATATCAT GGGAGCGACG
TTCATTACAG AAGGGGCGAT TCCGTTTGCG GCAGCCGACC CGGTACGCGT CATTCCATCC
ATCATTGTCG GCTCTGCGGT CAGCGGAGCG CTGACGATGT TGTTTCACAT TGGCCTTCCA
GCTCCGCATG GTGGAATTTT CGTTATTCCA ATTGTAAAAG GAAGTGCTTT ATTATACGTA
TTAGCGATTT TGATCGGTTC GATCATTACT GCTTTGATGG TTGGCTTGTG GAAAAAAGAA
GTAGAGGAAT AA
 
Protein sequence
MKITDLLTKE TIILHLKAKT KEEVIDELVA KLQEAGVLRD AQAFKEAIFA REAQSTTGVG 
DGIAIPHAKT AAVKRPAVAF GRSESGIDYD ALDGKPSRLF FMIAAPEGAN NTHLEALARL
SSMLMDSSFR ARIESVSNEE EFIRLIAEKE AEETKEAEHT ASSPSKRKKV IAVTACPTGI
AHTYMAADAL KAKAAEMDVD IKVETNGSSG VKNELTKQDI EEAVAVIVAA DKQVEMERFK
GKHVIQVPVA QAIRKPKELI EQALRQDAPI YQGSGAKEAT TVGKPRTGFY KHLMNGVSNM
LPFVVGGGIL IAISFIFGIK AFDPKDPSYH PIAKALMDIG GGNAFALMIP VLAGFIAMSI
ADRPGFAPGM VGGFMAANGG AGFLGGLIAG FLAGYLVVGL KKVFSHLPQS LEGIKPVLLY
PLFGIFITGL IMMYVVIDPV KALNEAMKHW LENMGTANLI LLGAILGGMM AVDMGGPINK
AAFTFGIAMI DAGNYAPHAA IMAGGMVPPL GLALATTFFK KKFTKAEREA GKTCYIMGAT
FITEGAIPFA AADPVRVIPS IIVGSAVSGA LTMLFHIGLP APHGGIFVIP IVKGSALLYV
LAILIGSIIT ALMVGLWKKE VEE