Gene GWCH70_0756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0756 
Symbol 
ID7979307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp831762 
End bp833411 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content43% 
IMG OID644797734 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002948908 
Protein GI239826284 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAGA AACTATCATT GTTTCTTGTG CTTCTCCTGG CTGTAACCAC ATTTCTAGCG 
GCCTGTGGAG GCAACAACGA CACAGCAAAA GACAAAGGCG GCACCGCAAA TAAGCCGGCC
GAAAAGAAAG AGCAAGTGTT GAACTTGCTG GATTCTTCGG AAATTCCATC GCTTGATTCC
GCGCTTGCGA AAGACCAAGT ATCGTTCATC GTGTTGAACA ACGTGATGGA AGGCCTTTAC
CGTTTAGGCA AAGATAACAA GCCGGTTCCA GGTGTGGCGG AAAGCTATGA AGTAAGCGAA
GATGGCAAAA TGTACACGTT TAAACTTCGC AAAGACGCAA AATGGTCGAA CGGCGACCCT
GTAACGGCGC ATGACTTCGT ATTTGCGTGG AGAAAAGTAT TAGATCCAAA AACAGCTTCC
GAATACGCCT ACATTATGTA TGACATTAAA AACGCGGAAG AAGTCAACCA AGGCAAATTG
CCAGTTGACC AGCTTGGCGT AAAAGCGGTA GATGACTATA CGCTTCAAGT AGAATTGAAA
AAACCAATTC CATACTTCAT CAGCTTAACC GTATTTGGAT CGTTCATGCC GCAAAACGAA
AAATTCGTAA AAGAGCAAGG CGACAAATAT GGCTTGGAAG CGAACACGAC GCTTTACAAC
GGTCCATTCG TATTAAGCGA ATGGAAGCAT GAACAAGGCT GGACATATAA GAAAAATCCA
AACTATTGGG ATAAAGACAA TGTGAAGCTA GAAACAATCA ATGTCAAAAT CGTAAAAGAT
ACCGCAACAG CTGTAAACCT TTACGACACG AAAAAAGTGG ACCGAGTTGG TCTAACTGCA
GAGTTTGTTG ATAAATATAA AAACGACAAA AACTTCCATA CAGAGCTTGA TCCATCTATT
TTCTGGCTGC GCATGAACAC GAAAAACGAA TTGTTGAAAA ACGTCAACGC TCGTAAAGCA
ATCGCCATGG CGATTGACAA ACAAGCGCTT GTTGACACGC TTCTTAACAA CGGTACAATT
CCGGCAAACT ATATTGTTCC AAAAGACTTT GTTAAAGGTC CAAACGGAAA AGATTTCCGT
GACGAAAACG GCGATTTAGT GAAATACGAT GTAGAAGAAG CGAAAAAATT ATGGGAACAA
GCGAAAAAAG AGCTTGGCAA AGACAAATTT ACGATTGAGC TGTTGAACTT TGATTCCGAT
ACCGCGAAGA AAACTGGTGA ATACTTGAAA GAGCAGCTTG AAAAGAACTT GCCTGGTCTT
ACGGTCAACA TTAAACAACA ACCGTTCAAA CAAAAGCTTG AGTTAGAAAG CAACATGCAA
TATGACCTAT CCTTCTCTGG CTGGGGCCCA GACTATCAAG ACCCAATGAC ATTCCTCGAT
CTTTGGGTGA CAAACAACCC GCACAACCAA ACAGGCTGGT CCAACCCAGA GTACGACAAG
CTTGTTAAAG ATGCGAAAAC AACGTTGCTA AGCGACTTGC AAGCCCGCTG GGATGCAATG
CTAAAAGCAG AAAAACTCTT GTTTGAAGAA ATGCCAGTCG CACCGCTTTA TCAACGCGGC
TCTGCGTATT TGCAACGTGA ATACGTAAAA GGTATTGTTT CTCATCCATT TGGCGGAGAT
TATAGTTATA AATGGGCATA TATCGAGTAA
 
Protein sequence
MKKKLSLFLV LLLAVTTFLA ACGGNNDTAK DKGGTANKPA EKKEQVLNLL DSSEIPSLDS 
ALAKDQVSFI VLNNVMEGLY RLGKDNKPVP GVAESYEVSE DGKMYTFKLR KDAKWSNGDP
VTAHDFVFAW RKVLDPKTAS EYAYIMYDIK NAEEVNQGKL PVDQLGVKAV DDYTLQVELK
KPIPYFISLT VFGSFMPQNE KFVKEQGDKY GLEANTTLYN GPFVLSEWKH EQGWTYKKNP
NYWDKDNVKL ETINVKIVKD TATAVNLYDT KKVDRVGLTA EFVDKYKNDK NFHTELDPSI
FWLRMNTKNE LLKNVNARKA IAMAIDKQAL VDTLLNNGTI PANYIVPKDF VKGPNGKDFR
DENGDLVKYD VEEAKKLWEQ AKKELGKDKF TIELLNFDSD TAKKTGEYLK EQLEKNLPGL
TVNIKQQPFK QKLELESNMQ YDLSFSGWGP DYQDPMTFLD LWVTNNPHNQ TGWSNPEYDK
LVKDAKTTLL SDLQARWDAM LKAEKLLFEE MPVAPLYQRG SAYLQREYVK GIVSHPFGGD
YSYKWAYIE