Gene GWCH70_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1463 
Symbol 
ID7976909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1535639 
End bp1536973 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content40% 
IMG OID644798367 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002949540 
Protein GI239826916 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAGGTTTTGC GAAACTCATC GCCTTATTGC TGGCAGCCGT TCTTATTGTT 
ACTGGTTGTC AAGGGCAAAA TGGGAATCAA GAGAAAAACG CAAAAGATGA TGAAGCTGGC
AAAGTTGTTC AATTTGAGTT CTGGGCTGCG CCAAACCCTA CACAACAAGC TTTCTGGAAA
AAGATGGCCG AGGCTTATAT GAAAGAAAAT AAAAACGTTA AAATTAAGGT TACGCCAATG
CCGGAAAGCC CTACTTCTGA AGCAGGGATT CAATCAGCGA TTGCATCAGG GAAAGCGCCT
GCGGTCTCTG AAAACATTTC CCGCGGTTTT GCAGCTCAGC TAGCGGCGAG CCGTGCGATT
GTGCCATTGG ATGAATTTGA AGGTTTTGAT GAGTTAATTG ATAAACGTCA AATGAAGGAA
ACCATTTCAA GCTGGAAGTT TGCAGACAAC CATCAATATG TTTTGCCGAT TTATTCTAAT
GCCATGTTGT TTGGTTGGAG AATCGATATT CTAAAAGAAC TTGGATACGA TGCACCGCCA
AAAACATATA GTGAAGTCAT TGAAGTTGGC AAGAAATTGA AAGAAAAATA TCCTGATAAG
TTTCTTTGGG CGAGAGCTGA TTTAGTAAAA CCGACATGGT GGGCAAGATG GTTTGACTTC
TTTATGTTGT ACAATGCAGC ATCGAATGGA AATAACTTTA TTAAAGGAAA CAAATTTATT
GCGGATGACG AAGCCGGTGT AAAAACACTC CAATTCTTTA ATGATTTAAG CAAGAATAAA
TTGCTTTTGA CTCGAGAAGC GACTGACCCG TTTGAAACAG GTACATCCAT TATGGTGGAT
CTTGGGCCAT GGACATTCCC ATACTGGGCT GAGAAATTCC CGGAAATGAA GTTCAATGAA
ACTTATGTAT TATCGTTGCC GCCTGTGCCT GACGGCGTAG ATCCAGCAAA TTCTAAAACA
TTTGCAGATA CAAAAGGTCT TGTCATCTAT GCTTCTGCAA GCAAAGAACA ACAACAAGCA
GCTTTTGACT TTATTAAATG GGTATTTTCA GATGCGAAAA ATGATTTAGC TTGGTTCAAA
CAAACAAACT TGCCACCTGC ACGTGATGAT TTATCAACTA ATGAAGCTTT TGCATCCTAT
CTTGAAGAAA ACCCTCAATT AAAACAGTAT GCGGAAAACA TTCCAAACGC AATTCCACCT
GTGGATAACG AAAAAACGGT AGAAATTCAA GAATTGATTG GTAAAGAGGC CTTAAATCCT
GTTGTCAAAG GCCAAAAAGA TCCTGAAACA GCTTGGAAAG ATATGAAAAA GGCTGTTAAC
GGGGTGTTAA AGTAA
 
Protein sequence
MKKKGFAKLI ALLLAAVLIV TGCQGQNGNQ EKNAKDDEAG KVVQFEFWAA PNPTQQAFWK 
KMAEAYMKEN KNVKIKVTPM PESPTSEAGI QSAIASGKAP AVSENISRGF AAQLAASRAI
VPLDEFEGFD ELIDKRQMKE TISSWKFADN HQYVLPIYSN AMLFGWRIDI LKELGYDAPP
KTYSEVIEVG KKLKEKYPDK FLWARADLVK PTWWARWFDF FMLYNAASNG NNFIKGNKFI
ADDEAGVKTL QFFNDLSKNK LLLTREATDP FETGTSIMVD LGPWTFPYWA EKFPEMKFNE
TYVLSLPPVP DGVDPANSKT FADTKGLVIY ASASKEQQQA AFDFIKWVFS DAKNDLAWFK
QTNLPPARDD LSTNEAFASY LEENPQLKQY AENIPNAIPP VDNEKTVEIQ ELIGKEALNP
VVKGQKDPET AWKDMKKAVN GVLK