Gene GWCH70_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2033 
Symbol 
ID7978984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2093924 
End bp2097229 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content36% 
IMG OID644798855 
Producthelicase domain protein 
Protein accessionYP_002950025 
Protein GI239827401 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCAG GAGCAAAGAA GTTACTAGAA TACTTACAGG AAAATAAAGA ATTAGATTTT 
AAAGATGCTA AAAATATAGG AAGTTTTTGG ACTGATAAAA CCATTAAGCG CTTTGTGGAA
GAGTACAATG ATATTTTTGT TGTTGAAAAC CAAAAAGTAA GACTTAAGAA GGGGGAAAAC
TTACAGTCTT ATAAGACTCA TATAGATGTG AATGAATATG TTCAAATGCG TTCAACGATT
CTTTCAGAAT TAGAGAAATT TCTCGTTGGT CCATTTGAAG AAAATGAAAC TCTCGGTCGT
AGAAAAGCAC CAATGGCCCT TTATCTAACT GGGAAGCTTG TTCCTTTCGG TTCCACTTTT
GATGTTATTA ATGAAGAAGA AAATCATATT GAAACAAAAC AATTATTAGA AGATGAAGCG
ATGGATGAAA TGCTTATTCA TCGCCATGTA TTTCGTCCAT CTGCGATGGG ATTTAGTTTT
AAAATGAAAT CCCTTTCGAG TATAAAAGTT CACATTAGTT GGGGAATGTA CGATGATGAA
GAACATAAAC GGACGCAACT ACAAGAAGAA TGGTGTTTTG TTCCTGAAAA TAAAACATAT
GTGGCTAAAA ATGAGCCTGC TCGAGTACGT TGTAAAATTA AATATCATGA CGGTTTATAT
CATATTAGTT TATTTTTGAT TAACAGCTAT AAACGGGATT CATATCCAAA ACAAAGTGAA
ATTATGTTTC AAACAAAAAT GATTGTGGAG GTTCCTAAGG AACATATAGC TGTTTTTTCT
TCCAAAGCTG ATATAAATCA TTACGAAGAT GAGCTATTAT ACGGACGGCA TTTTCATGAG
TATGCAGTTG GTCATGGGGT TGGCGTAGAT TGGAAAGAAA CGGATCAGTA CGTCATTATT
GAGAGCAAAT GGCTTCCTTT TTATGAATTG CCTGTTGTAG AGCATCGTAC TTTCTCTCAT
GCTCGATTTT TTATGAAAGA GTTAAGTGAA ATGGATTCGG AAAGTTTACA TACAACATTA
TCTATTATTC CTGAGCAATA TGAAAAATGG CTATGTGAAC AAAAAGGTCA TATTCCATCT
TTGCCGGAAC ATTTACAAGA TATAGCGAGA AATAATGTTA ACAAGATTGA AAAAATAATT
AAAAGAATTC GAGAAGGTAT TTGTTTAATT ATAAGCAATC CTTCTGTAAA AGAAGCATTC
CAATTTGCAA ATAAAGTTAT GATGATTCAG CAAGCTCAAT CGAAGGTTGC TTTGCATTAT
CGTACTTTTC AAGAGAGAAT AGAACCAAAA TATACATCGG AATGGCGTTT ATTCCAAATA
GCATTTTTAT TAATGAATAT CGCTGGTATT GTTGATAGAC ATCACGAAGA TAGAGATGTT
GTAGATTTAA TCTGGTTTCC GACAGGTGGG GGGAAAACAG AGGCGTATCT AGGATTAGCT
GCATTTACCA TGGGATATCG CCGGCTTATT GGGGAGTGGG ATAATCCGGA AACATATGCA
GGTGTTACTG TATTTATGCG TTATACATTG CGGTTGCTAA CAACACAGCA ATTTCAAAGA
GCCGCAGCTA TGATTTGCGC TGCTGAGTTG ATTCGTCAGG AAAATCCTGA AAAATATGGA
ATTGAACCAT TTCGTATAGG ACTTTGGATA GGACAATCTT CTTCACCTAA TACATATGAA
GATGCAATCC TAAAAATGGA GCAAATTAGG GAAGGAAACG AAGTACTAGA AGGAAATCCG
ATGCAGTTAA CTCACTGTCC TTGGTGCGGA ACAGAACTTA ATGCGGAGGA TTACATCATT
GAGCGGCATA AGCAATTAAT TCGTTGTCAT TATTATGATT GTCCTTTTTC ATCTGAAAAG
GGAATTCCTG CTTTAACGAT TGATGAGGCT ATTTATCAAT ATGTTCCTAC GATTCTTATA
GGAACTGTTG ATAAAATGGC GCAGATTGCT TGGAAAAAAG ATATGTATGA GTTATTTGGT
CGTAAAACTC ATTATGATTT AGAAAAGGGG TTTATTTTTT CTGAAACGAA TAAAAAAGGA
TATAAGAAAA TTAATTATTT AAAGCCTCCT GAATTGATTA TTCAAGATGA ACTGCATTTA
ATCTCTGGTC CATTAGGATC TTTGACGGGT CTATACGAAT TAGCTGTAGA TTATTTATGT
CAATATGATG GAGCAGGTCC TAAAATTGTC GCATCTACGG CCACTATTAG AGGAGCTGAT
GAACAGATTC GTCGTCTCTA CGGCCGTGAG GCTAGTCAAT TTCCTTTACC AGTCCTAAAA
GCAACAGATA ATTTTGTATC GTATGAAGTT CCAACACAAC AAAAGCCAGG AAGGTTATAT
GTAGGAATTT GTGCACCGGG TGTCAGTGGA AAAATTCATT CTGTCCATGT TTATTCAGCA
TTGTTGACGA TCAGTGAAAA ATTAAAAGGA CCTGTAATTG ATCCGTATTG GACAATTTTA
GGGTATTTTA ATACGATAAA AGAATTAGCA GGAACAACAA TGCTTTTTAA AGATGAAATC
CCAGTTCGTT TAAAATTACT TAATGAGGAT TCCGAGCAAA AGGAATTAAT TATTGAGGAA
ATGACGAGTC GAAGAAAAGC AAGGGAAATT CCTCATTTGC TGGCTCAAAT GGAAAAAACG
TATGCAGAAA ACGGAGCTCT TAACGCTGTA TTAGCTACAA ATATGATTTC GGTAGGAGTG
GATGTCAATC GTCTTGGAAT TATGGTTGTG CATGGTCAGC CAAAAACGAC ATCAGAATAT
ATTCAAGCAA CTAGCCGTGT TGGAAGAACA TATCCAGGTC TTGTTTTAAC CTTATTCAAC
TCTTTACGTC CACGTGATTT ATCACACTAT GAAAGATTTA AATCCTACCA TAGTTCAATT
TACCGTTTTG TTGAACCTAC AAGTGTGACA CCATTCGCTC GAGGTAGTAT TCAACGTGGA
TTAACCGGCC TGGTAGTAGG ATCAATGCGG CAAGGAATCA TAGAGATTAG CAAAGAACAA
AGTGCAAAAC GTTTTGTGAT AAACGAAGAC GTCGAAAAGA TTAAGAAATT TTTAATTGAG
AGAGCCGTAA AAACAGGAGA AATATCTGAG CAAGAACTTG AACAACATAT TGAGAGCGTT
TTAGATTGGT GGCTTGGAAT GACAAATAAA TATGATTCCC TTGCCTATCG AGCTTCAAAA
TATAATCGCA TGCCATATTT ATTGAAAGCT TTTGGTGATA GCAATGCATT GAAAGATGCA
AGACCTGCGA TGCATTCTCT CAGAAGTGTG GAAGCAGAGA TTGAAGTAAA AGCGTGGAAA
GGATAA
 
Protein sequence
MKPGAKKLLE YLQENKELDF KDAKNIGSFW TDKTIKRFVE EYNDIFVVEN QKVRLKKGEN 
LQSYKTHIDV NEYVQMRSTI LSELEKFLVG PFEENETLGR RKAPMALYLT GKLVPFGSTF
DVINEEENHI ETKQLLEDEA MDEMLIHRHV FRPSAMGFSF KMKSLSSIKV HISWGMYDDE
EHKRTQLQEE WCFVPENKTY VAKNEPARVR CKIKYHDGLY HISLFLINSY KRDSYPKQSE
IMFQTKMIVE VPKEHIAVFS SKADINHYED ELLYGRHFHE YAVGHGVGVD WKETDQYVII
ESKWLPFYEL PVVEHRTFSH ARFFMKELSE MDSESLHTTL SIIPEQYEKW LCEQKGHIPS
LPEHLQDIAR NNVNKIEKII KRIREGICLI ISNPSVKEAF QFANKVMMIQ QAQSKVALHY
RTFQERIEPK YTSEWRLFQI AFLLMNIAGI VDRHHEDRDV VDLIWFPTGG GKTEAYLGLA
AFTMGYRRLI GEWDNPETYA GVTVFMRYTL RLLTTQQFQR AAAMICAAEL IRQENPEKYG
IEPFRIGLWI GQSSSPNTYE DAILKMEQIR EGNEVLEGNP MQLTHCPWCG TELNAEDYII
ERHKQLIRCH YYDCPFSSEK GIPALTIDEA IYQYVPTILI GTVDKMAQIA WKKDMYELFG
RKTHYDLEKG FIFSETNKKG YKKINYLKPP ELIIQDELHL ISGPLGSLTG LYELAVDYLC
QYDGAGPKIV ASTATIRGAD EQIRRLYGRE ASQFPLPVLK ATDNFVSYEV PTQQKPGRLY
VGICAPGVSG KIHSVHVYSA LLTISEKLKG PVIDPYWTIL GYFNTIKELA GTTMLFKDEI
PVRLKLLNED SEQKELIIEE MTSRRKAREI PHLLAQMEKT YAENGALNAV LATNMISVGV
DVNRLGIMVV HGQPKTTSEY IQATSRVGRT YPGLVLTLFN SLRPRDLSHY ERFKSYHSSI
YRFVEPTSVT PFARGSIQRG LTGLVVGSMR QGIIEISKEQ SAKRFVINED VEKIKKFLIE
RAVKTGEISE QELEQHIESV LDWWLGMTNK YDSLAYRASK YNRMPYLLKA FGDSNALKDA
RPAMHSLRSV EAEIEVKAWK G