Gene GWCH70_3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3408 
Symbol 
ID7976187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3438933 
End bp3440759 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content39% 
IMG OID644800172 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002951311 
Protein GI239828687 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TAGGCCCATT TCGTTCGATT CATTTCAAGT TTGCCGTTAT ATACGTTTTG 
CTTATTCTCA TAGCGATGCA AATTATCGGT GTTTATTTTG TTAGAAAGCT AGAAACGGAG
TTAGTGCAAA GTTTTAAAAA TTCCCTGAAT GAACGGGTGA CGCTATTAGC GTATAATGTG
GAACAAGCGA TGAATAAAGA AAGGGATTCG AAAAGTCCGA TGTTAGAAGA AGATATCCGC
TCCCTTCTCA ATGATTTTGT TTCCGATGAC ATTTCTGAAG TCCGTGTCAT CGATCATAAA
GGCAAAGTGT TAGCTACATC GAATCCATAT ATGCAAAATA TCGTTGGAAA GCGGACAACG
GAAATTTATG TGAAGCGATC GCTTGTGACT GGGGAAATCG TCGATCAAAT GTTCATAGAT
CAGAAAGGAC ACCGTATGTA TATTTCTTCT ACTCCAATTA AATCGAGTGG AGAAATAAAG
GGAGTTATTT ACATCATTGC TTCGATGGAG AACGTATTCT CGCAAATGAA ACAAATTAAT
ACAATTTTAG CAACAGGAAC CGGAATAGCT CTCTTTATTA CTGCGGTACT CGGGGTTCTT
TTGTCACGCA CGATTACTCG TCCGATTTCA GATATGCGAA AACAAGCGTT AGCGATGACA
AAGGGAGATT TCTCCCGCAA AGTAAAAATT TACGGTTATG ATGAAATCGG ACAATTAGCA
ATGACGTTTA ATAATTTAAC GAAAAAGCTG CAAGAAGCTC AAGCGACAAC AGAAGGGGAA
CGGCGTAAGC TTGAATCCGT ATTAACACAT ATGACAGATG GAGTGATTGC CACAGACCGC
CGCGGCAGAA TTATACTCAT TAATGATGCC GCCTTAAACA TTTTGAATGT TTCACGTGAA
ACAGTACTGT CGAGTTCCAT TATCGATGTA CTTGGAATTG GTGACCAATA TACGTTTGAA
ACACTGTTGG AGGAGCGGGA TTCGCTCATT TTAGATTTTA GTACGGATGA AGGGCTATAT
ATTTTGCGTG CATCGCTTTC TGTGATTCAA AAAGAATCAG GACTTATTAA CGGATTAATT
GTCGTATTGC ATGATATTAC GGAACAAGAA AAAATTGACC GCGAACGTAG AGAATTTGTT
GCCAATGTTT CACATGAGCT TCGCACTCCG TTAACGACGA TGAAAAGTTA TTTGGAAGCA
TTAGCGGAAG GAGCTTGGCA AGATAAAGCA ATCGCTCCTC GGTTTATCGA AGTAGTGCAA
ACAGAAACAG AACGAATGAT TCGCTTAGTC AATGACTTAT TACAGCTTTC TAAACTGGAC
AGTAAAGATT ATAAGCTAAA TAAATCATGG GTAAATTTTT CGGAGTACTT TCATAAAGTG
ATTGACCGTT TTGAATTGAC AAAAAGCGAA AACATTACGT TTGTGCGAAA AATCCCAAAA
GAAGCGATTT TTATTGAAAT CGATAAAGAT AAAATTACGC AAGTATTAGA TAATATTATT
TCCAACGCCA TCAAATATTC TCCACAAGGC GGAAAAATTA CGTTTCGCGT TAGAGAGCTT
GCCGATGAAA TTATTGTGAG TGTCAGCGAT GAAGGAGTCG GTATTCCTAA GGGGGATCTC
GCGAAAGTGT TCGAACGGTT TTATCGTGTC GATAAGGCAA GATCACGCAA ACTCGGCGGT
ACAGGGTTAG GATTAGCAAT TGCGAAAGAA GTGGTGATCG CTCATGGAGG AACCATTTGG
GCGGAAAGTA AGGAGCATAA AGGAACAACG ATTTTCTTTA CTTTACCGTT GGAAAGAGAT
CAAAAGGATG ACTGGGACGA TGTATGA
 
Protein sequence
MKKVGPFRSI HFKFAVIYVL LILIAMQIIG VYFVRKLETE LVQSFKNSLN ERVTLLAYNV 
EQAMNKERDS KSPMLEEDIR SLLNDFVSDD ISEVRVIDHK GKVLATSNPY MQNIVGKRTT
EIYVKRSLVT GEIVDQMFID QKGHRMYISS TPIKSSGEIK GVIYIIASME NVFSQMKQIN
TILATGTGIA LFITAVLGVL LSRTITRPIS DMRKQALAMT KGDFSRKVKI YGYDEIGQLA
MTFNNLTKKL QEAQATTEGE RRKLESVLTH MTDGVIATDR RGRIILINDA ALNILNVSRE
TVLSSSIIDV LGIGDQYTFE TLLEERDSLI LDFSTDEGLY ILRASLSVIQ KESGLINGLI
VVLHDITEQE KIDRERREFV ANVSHELRTP LTTMKSYLEA LAEGAWQDKA IAPRFIEVVQ
TETERMIRLV NDLLQLSKLD SKDYKLNKSW VNFSEYFHKV IDRFELTKSE NITFVRKIPK
EAIFIEIDKD KITQVLDNII SNAIKYSPQG GKITFRVREL ADEIIVSVSD EGVGIPKGDL
AKVFERFYRV DKARSRKLGG TGLGLAIAKE VVIAHGGTIW AESKEHKGTT IFFTLPLERD
QKDDWDDV