Gene GWCH70_0845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0845 
Symbol 
ID7979333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp908978 
End bp910123 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content44% 
IMG OID644797818 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002948991 
Protein GI239826367 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAT TTAGACTTAA CTATTTAGTC GCATTTCTTT TTTCCATTAT CGTCATCCTT 
AGCGGCTGCA TGAATGAACA AGACGCCGTC AGTTCGAAAC CGGCCAAACA AGAAAACAGC
ACGGCCGAAG AAGCAAGCGA AACAACAGCA GCAACAACCG AACAAGCAGA TATCCCTGAA
GAATTAAAAA AACCAATAAA AATTGCCGCC ATTATGCAAA TGTCGATCGG CACGTTTTCT
TCTCAGTATA TCGCTGGCGT GAAAGAACAA GTGCAAAAAT TTGGCGGCGA AGTGCAAATT
TACAATGCGG ATAACGACTT AACGAAAATG GCTTCCTATG TCGAAACAGC CATTACGCAA
AACGTCGATG CGATTCTGTT AGATCACGGT CGCGCTGATG CGTTGGAAGG CCCGGTGAAA
AAGGCAGTCG AAAAAGGCAT TCCTGTTGTC GCATTCGATA ATGATTTAAA CATCCCAGGC
GTAACAGTGA TTGACCAAGA TGACTACAGC CTTGCATGGA AAACGTTAAA AACGTTGGCC
GAAGACTTAA ACGGCGAAGG AAACATCGTC ACCATTTGGG TTGGCGGATT TACACCGATG
GAACGGCGCC ATGTGATTTA CGATGCGTTT AAAAAACGCT ATCCGAACAT CAAAGAAGTG
GCGAAATTCG GTACGGCCAG CGCCAATACC GCTTTAGACA CACAAACGCA AATGGAAGCG
ATTTTGAAAA AATATCCGAA TAAAGGCGAT ATTGACGCAG TGTTTGCGAC TTGGGATGAA
TTTGCCAAAG GCGCAACACG TGCGATCGAG CAAGCAGGAC GCAATGAAAT TAAAGTATAT
GGCATCGATT TAAGCGATGA AGATTTGCAA ATGATGCAAA AGCCAAATAG CCCTTGGGTC
GCTACAACGG CAACGGATCC AGCCGAAGTC GGTCGCGTTC AAGTTCGCTT TGCGTATCAA
AAAATTGCTG GCGAGAAAAC ACCAAACATT TATTCACTAG AACCGCATTT AGTCAAACGA
TCCGATTTGC CTAACCAACA AGTTTCGATG AATGAGCTTC ACCAATACAT TTCCGGCTGG
GGGCAATCAA ATGTTGCCAT TTCCCCTCGG ATGAAAACAT TGGAAGCACA GGTGAAAAAC
AAATGA
 
Protein sequence
MKQFRLNYLV AFLFSIIVIL SGCMNEQDAV SSKPAKQENS TAEEASETTA ATTEQADIPE 
ELKKPIKIAA IMQMSIGTFS SQYIAGVKEQ VQKFGGEVQI YNADNDLTKM ASYVETAITQ
NVDAILLDHG RADALEGPVK KAVEKGIPVV AFDNDLNIPG VTVIDQDDYS LAWKTLKTLA
EDLNGEGNIV TIWVGGFTPM ERRHVIYDAF KKRYPNIKEV AKFGTASANT ALDTQTQMEA
ILKKYPNKGD IDAVFATWDE FAKGATRAIE QAGRNEIKVY GIDLSDEDLQ MMQKPNSPWV
ATTATDPAEV GRVQVRFAYQ KIAGEKTPNI YSLEPHLVKR SDLPNQQVSM NELHQYISGW
GQSNVAISPR MKTLEAQVKN K