Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2033 |
Symbol | |
ID | 7978984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2093924 |
End bp | 2097229 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 644798855 |
Product | helicase domain protein |
Protein accession | YP_002950025 |
Protein GI | 239827401 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCAG GAGCAAAGAA GTTACTAGAA TACTTACAGG AAAATAAAGA ATTAGATTTT AAAGATGCTA AAAATATAGG AAGTTTTTGG ACTGATAAAA CCATTAAGCG CTTTGTGGAA GAGTACAATG ATATTTTTGT TGTTGAAAAC CAAAAAGTAA GACTTAAGAA GGGGGAAAAC TTACAGTCTT ATAAGACTCA TATAGATGTG AATGAATATG TTCAAATGCG TTCAACGATT CTTTCAGAAT TAGAGAAATT TCTCGTTGGT CCATTTGAAG AAAATGAAAC TCTCGGTCGT AGAAAAGCAC CAATGGCCCT TTATCTAACT GGGAAGCTTG TTCCTTTCGG TTCCACTTTT GATGTTATTA ATGAAGAAGA AAATCATATT GAAACAAAAC AATTATTAGA AGATGAAGCG ATGGATGAAA TGCTTATTCA TCGCCATGTA TTTCGTCCAT CTGCGATGGG ATTTAGTTTT AAAATGAAAT CCCTTTCGAG TATAAAAGTT CACATTAGTT GGGGAATGTA CGATGATGAA GAACATAAAC GGACGCAACT ACAAGAAGAA TGGTGTTTTG TTCCTGAAAA TAAAACATAT GTGGCTAAAA ATGAGCCTGC TCGAGTACGT TGTAAAATTA AATATCATGA CGGTTTATAT CATATTAGTT TATTTTTGAT TAACAGCTAT AAACGGGATT CATATCCAAA ACAAAGTGAA ATTATGTTTC AAACAAAAAT GATTGTGGAG GTTCCTAAGG AACATATAGC TGTTTTTTCT TCCAAAGCTG ATATAAATCA TTACGAAGAT GAGCTATTAT ACGGACGGCA TTTTCATGAG TATGCAGTTG GTCATGGGGT TGGCGTAGAT TGGAAAGAAA CGGATCAGTA CGTCATTATT GAGAGCAAAT GGCTTCCTTT TTATGAATTG CCTGTTGTAG AGCATCGTAC TTTCTCTCAT GCTCGATTTT TTATGAAAGA GTTAAGTGAA ATGGATTCGG AAAGTTTACA TACAACATTA TCTATTATTC CTGAGCAATA TGAAAAATGG CTATGTGAAC AAAAAGGTCA TATTCCATCT TTGCCGGAAC ATTTACAAGA TATAGCGAGA AATAATGTTA ACAAGATTGA AAAAATAATT AAAAGAATTC GAGAAGGTAT TTGTTTAATT ATAAGCAATC CTTCTGTAAA AGAAGCATTC CAATTTGCAA ATAAAGTTAT GATGATTCAG CAAGCTCAAT CGAAGGTTGC TTTGCATTAT CGTACTTTTC AAGAGAGAAT AGAACCAAAA TATACATCGG AATGGCGTTT ATTCCAAATA GCATTTTTAT TAATGAATAT CGCTGGTATT GTTGATAGAC ATCACGAAGA TAGAGATGTT GTAGATTTAA TCTGGTTTCC GACAGGTGGG GGGAAAACAG AGGCGTATCT AGGATTAGCT GCATTTACCA TGGGATATCG CCGGCTTATT GGGGAGTGGG ATAATCCGGA AACATATGCA GGTGTTACTG TATTTATGCG TTATACATTG CGGTTGCTAA CAACACAGCA ATTTCAAAGA GCCGCAGCTA TGATTTGCGC TGCTGAGTTG ATTCGTCAGG AAAATCCTGA AAAATATGGA ATTGAACCAT TTCGTATAGG ACTTTGGATA GGACAATCTT CTTCACCTAA TACATATGAA GATGCAATCC TAAAAATGGA GCAAATTAGG GAAGGAAACG AAGTACTAGA AGGAAATCCG ATGCAGTTAA CTCACTGTCC TTGGTGCGGA ACAGAACTTA ATGCGGAGGA TTACATCATT GAGCGGCATA AGCAATTAAT TCGTTGTCAT TATTATGATT GTCCTTTTTC ATCTGAAAAG GGAATTCCTG CTTTAACGAT TGATGAGGCT ATTTATCAAT ATGTTCCTAC GATTCTTATA GGAACTGTTG ATAAAATGGC GCAGATTGCT TGGAAAAAAG ATATGTATGA GTTATTTGGT CGTAAAACTC ATTATGATTT AGAAAAGGGG TTTATTTTTT CTGAAACGAA TAAAAAAGGA TATAAGAAAA TTAATTATTT AAAGCCTCCT GAATTGATTA TTCAAGATGA ACTGCATTTA ATCTCTGGTC CATTAGGATC TTTGACGGGT CTATACGAAT TAGCTGTAGA TTATTTATGT CAATATGATG GAGCAGGTCC TAAAATTGTC GCATCTACGG CCACTATTAG AGGAGCTGAT GAACAGATTC GTCGTCTCTA CGGCCGTGAG GCTAGTCAAT TTCCTTTACC AGTCCTAAAA GCAACAGATA ATTTTGTATC GTATGAAGTT CCAACACAAC AAAAGCCAGG AAGGTTATAT GTAGGAATTT GTGCACCGGG TGTCAGTGGA AAAATTCATT CTGTCCATGT TTATTCAGCA TTGTTGACGA TCAGTGAAAA ATTAAAAGGA CCTGTAATTG ATCCGTATTG GACAATTTTA GGGTATTTTA ATACGATAAA AGAATTAGCA GGAACAACAA TGCTTTTTAA AGATGAAATC CCAGTTCGTT TAAAATTACT TAATGAGGAT TCCGAGCAAA AGGAATTAAT TATTGAGGAA ATGACGAGTC GAAGAAAAGC AAGGGAAATT CCTCATTTGC TGGCTCAAAT GGAAAAAACG TATGCAGAAA ACGGAGCTCT TAACGCTGTA TTAGCTACAA ATATGATTTC GGTAGGAGTG GATGTCAATC GTCTTGGAAT TATGGTTGTG CATGGTCAGC CAAAAACGAC ATCAGAATAT ATTCAAGCAA CTAGCCGTGT TGGAAGAACA TATCCAGGTC TTGTTTTAAC CTTATTCAAC TCTTTACGTC CACGTGATTT ATCACACTAT GAAAGATTTA AATCCTACCA TAGTTCAATT TACCGTTTTG TTGAACCTAC AAGTGTGACA CCATTCGCTC GAGGTAGTAT TCAACGTGGA TTAACCGGCC TGGTAGTAGG ATCAATGCGG CAAGGAATCA TAGAGATTAG CAAAGAACAA AGTGCAAAAC GTTTTGTGAT AAACGAAGAC GTCGAAAAGA TTAAGAAATT TTTAATTGAG AGAGCCGTAA AAACAGGAGA AATATCTGAG CAAGAACTTG AACAACATAT TGAGAGCGTT TTAGATTGGT GGCTTGGAAT GACAAATAAA TATGATTCCC TTGCCTATCG AGCTTCAAAA TATAATCGCA TGCCATATTT ATTGAAAGCT TTTGGTGATA GCAATGCATT GAAAGATGCA AGACCTGCGA TGCATTCTCT CAGAAGTGTG GAAGCAGAGA TTGAAGTAAA AGCGTGGAAA GGATAA
|
Protein sequence | MKPGAKKLLE YLQENKELDF KDAKNIGSFW TDKTIKRFVE EYNDIFVVEN QKVRLKKGEN LQSYKTHIDV NEYVQMRSTI LSELEKFLVG PFEENETLGR RKAPMALYLT GKLVPFGSTF DVINEEENHI ETKQLLEDEA MDEMLIHRHV FRPSAMGFSF KMKSLSSIKV HISWGMYDDE EHKRTQLQEE WCFVPENKTY VAKNEPARVR CKIKYHDGLY HISLFLINSY KRDSYPKQSE IMFQTKMIVE VPKEHIAVFS SKADINHYED ELLYGRHFHE YAVGHGVGVD WKETDQYVII ESKWLPFYEL PVVEHRTFSH ARFFMKELSE MDSESLHTTL SIIPEQYEKW LCEQKGHIPS LPEHLQDIAR NNVNKIEKII KRIREGICLI ISNPSVKEAF QFANKVMMIQ QAQSKVALHY RTFQERIEPK YTSEWRLFQI AFLLMNIAGI VDRHHEDRDV VDLIWFPTGG GKTEAYLGLA AFTMGYRRLI GEWDNPETYA GVTVFMRYTL RLLTTQQFQR AAAMICAAEL IRQENPEKYG IEPFRIGLWI GQSSSPNTYE DAILKMEQIR EGNEVLEGNP MQLTHCPWCG TELNAEDYII ERHKQLIRCH YYDCPFSSEK GIPALTIDEA IYQYVPTILI GTVDKMAQIA WKKDMYELFG RKTHYDLEKG FIFSETNKKG YKKINYLKPP ELIIQDELHL ISGPLGSLTG LYELAVDYLC QYDGAGPKIV ASTATIRGAD EQIRRLYGRE ASQFPLPVLK ATDNFVSYEV PTQQKPGRLY VGICAPGVSG KIHSVHVYSA LLTISEKLKG PVIDPYWTIL GYFNTIKELA GTTMLFKDEI PVRLKLLNED SEQKELIIEE MTSRRKAREI PHLLAQMEKT YAENGALNAV LATNMISVGV DVNRLGIMVV HGQPKTTSEY IQATSRVGRT YPGLVLTLFN SLRPRDLSHY ERFKSYHSSI YRFVEPTSVT PFARGSIQRG LTGLVVGSMR QGIIEISKEQ SAKRFVINED VEKIKKFLIE RAVKTGEISE QELEQHIESV LDWWLGMTNK YDSLAYRASK YNRMPYLLKA FGDSNALKDA RPAMHSLRSV EAEIEVKAWK G
|
| |