Gene GWCH70_1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1392 
Symbol 
ID7978189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1462257 
End bp1463594 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content41% 
IMG OID644798315 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002949488 
Protein GI239826864 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GCATTTTTAT TTTGTTCACG TTCATTTTTA TGATACTGGC TGCTTGTTCA 
AATCAAAGTG ATGAAGCAAC GGCCACTCCA TCTGAAAACA AAGACGGGAA AACAGAGGTC
ATTTTTTGGC ATGCGATGAG TGGCGATTTA GAAACAGTGT TAAAAGAGAT TGTGGAGGAA
TTTAACCAGT CCCATCCTTC TATTAAGGTG AAGCCGATTT TTCAAGGTAC ATACGAAGAG
GCGTTGACGA AATTTAATAC GGTTGCTGGA ACGAAAGACA CGCCAACGAT TATGCAAACA
TTTGAAGTTG GAACAAAGTA TATGATTGAT AGCGGCAAAA TTCAACCTGT CCAAAAATTT
ATTGATGAAG AGAATTATGA CATATCGCAA TGGGAAAAAA ATATCGTGAA TTATTATACG
GTGGATGGAA AAATCTACTC CATGCCGTTC AACTCCTCCA CGCCTGTGTT GATTTACAAC
AAAGATGCGT TCAAAGAAGC AGGATTGGAC CCTGAAAAGC CGCCGATGAC GTATAGCGAG
TTAAAAGAAG CGGCAAGAAA ATTGACAAAA AAACAAGGGG GACAAACGAC TCAATATGGC
TTTTCCATTT TAAATTACGG CTGGTTTTTT GAAGAAATGC TCGCGGTGCA AGGCGGGCTG
TACGTCAATA ATGAAAATGG CCGCAAGGGA AACGCTACGA AAGCGATTTT TAATAGTGAA
GAAGGATTGC GTGTATTCGA TTTGATTCGT GACATGTATA AAGAAGGAAC GTTCTACAAC
GTAGGACAAA ACTGGGATGA TATGCATGCG GCGTTTCAAT CAAAGAAAGT CGCGATGTAT
TTAGACTCTT CTGCAGGAGT GAAAACGATT ATCGACAATG CGCCGTTTGA GGTAGGAGTT
TCTTACTTGC CAGTTCCAGA CGGTGTCGAG CGTCAAGGAG TGATCATTGG TGGAGCGTCG
CTATGGATGG CCAAAAATAT TAGTGAAAAG CAACAAAAAG CGGCATGGGA GTTCATGAAA
TATTTAGCAA CACCTGAAGT TCAAGCGAAA TGGCATGTGA AAACAGGATA TTTTGCGATT
AATCCAGCTG CTTATGACGA AGAGATCGTG AAAGCGGAGT GGGAAAAGTA CCCTCAATTA
AAAGTAACCG TCGATCAATT AAAACAAACA AAACCAATAC CTGCGACACA AGGAGCGCTT
ATATCTGTTT TTCCGGAATC TCGTCAAAAA GTAGTGAAAG CAATGGAAAG CTTATACCAA
GGTGTTGATC CGAAAGAAGC GTTGGATCGT GCAGCAGCGG AAACGAACCG AGCTCTTGAA
GTAGCGAATA AAAAATAA
 
Protein sequence
MKKSIFILFT FIFMILAACS NQSDEATATP SENKDGKTEV IFWHAMSGDL ETVLKEIVEE 
FNQSHPSIKV KPIFQGTYEE ALTKFNTVAG TKDTPTIMQT FEVGTKYMID SGKIQPVQKF
IDEENYDISQ WEKNIVNYYT VDGKIYSMPF NSSTPVLIYN KDAFKEAGLD PEKPPMTYSE
LKEAARKLTK KQGGQTTQYG FSILNYGWFF EEMLAVQGGL YVNNENGRKG NATKAIFNSE
EGLRVFDLIR DMYKEGTFYN VGQNWDDMHA AFQSKKVAMY LDSSAGVKTI IDNAPFEVGV
SYLPVPDGVE RQGVIIGGAS LWMAKNISEK QQKAAWEFMK YLATPEVQAK WHVKTGYFAI
NPAAYDEEIV KAEWEKYPQL KVTVDQLKQT KPIPATQGAL ISVFPESRQK VVKAMESLYQ
GVDPKEALDR AAAETNRALE VANKK