Gene GWCH70_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2115 
Symbol 
ID7976926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2183156 
End bp2185921 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content42% 
IMG OID644798931 
Productbifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon 
Protein accessionYP_002950091 
Protein GI239827467 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGACC GTTTTGTCAT TATTGATTTA GAAACAACGG GGAATGTTCC GAAAAAAGGC 
GACCGCATCA TTCAGCTTGG CATGGTGGTC ATTGAGAACG GGAAAATTGT TGATCGTTTC
TCTAGTTTTT TTAATCCAGA ACGAGACATT CCTCCATTTG TTCAGCAATT AACAAATATT
ACTGAACAAA TGGTCGCGGA TGCACCTAGT TTTGCTGAGG AAGCTGCAAA AGTGGCTGAC
ATGTTGCAAC AGTCTTATTT TGTTGCTCAT AACGTCTCTT TTGATTTGCA GTTTTTGCAG
GAAGAATTGC ATATGGCAGG ATTGCCTCCG TTTTCAGGAC CGACGATTGA TACGGTTGAA
CTTGCCCGCA TTGTCTTTCC GACTGCCGAA AGCTATAAAT TAGGAGATTT AGCAAAACAG
CTGCATATTG ACCATGACCA GCCGCATCAA GCCGATAGCG ATGCGGAAGT GACAGCTAAA
TTGTTCATTG CGTTACTGAA CCGTTTGCGT CAACTGCCGT TTATAACATT GCAGCATCTA
AAGAGATTGT CTCCTTATCT TAAAAGTGAT TTATATCCAT TGTTAGACAA TATCATTATG
GAAAAGATGG CGTCTTTAGC GGATGAAAAG TCGTATATAT TTTACCGCGG CATTGCGCTG
AAAAAACCTG TTTCGATGCA AAGAAAAGAG GAAGATCATC ATAAAGACGC CTCATTTGCC
GCGTTTTTTG CTGGCCCGCA TACGCTTCCG CTTGATCATT ATGAAAGACG AGATGGGCAA
TGGGAAATGA TGAAGCTCGT TTATGAGGCG CTCGAAACGT CTCAGCACGC TTTAATCGAA
GCGGGGACAG GAATTGGCAA ATCGCTGGCG TATTTAATTC CAAGCGCTTT TTTTGCATAT
GAACAGCAAA AACGTGTTGT GATTAGTACA CATACGCTTC AGTTGCAACA GCAACTTCTT
GAACGGGATA TTCCTTTTTT AAAAGAGGTC GTTCCGTTTC CGCTTCGCGT AGCGGTGCTG
AAAGGAAAGC GCAATTATCT TTCATTGGAC AAGTTCATCG CTTTTTTGCA GGAACATCAT
CAAAATTACG ATGTTGTGTT ATTAAAATGT CAGCTTCTTG TTTGGTTGAC GCAAACGGAA
ACGGGAGATA TGGATGAATT GAATGCATCG TCCGGCGCGC GTTTATTTTG GCCGCTGTTA
GTTATGGATG AAGAAGATAA TGGCGGAAAA TATAATTTCT TTTGGAGAGC GAAGCAGCGA
GCGGAAGAAG CTCATATTGT GATTACGAAT CACGCTTTTT TATTGCATGA TGTTACTGCT
TCTACTCCGC TTCTGCCTGA TTATGAGCAT ATGATTATGG ATGAAGCGCA TCATTTAGAA
GAGGTAGCGT CCCATTATTT TGGACAGCAA GTGGATTATG TTTCGATTCG TTTAATACTA
ACAAGAATTG GAAAAGTGAA TGAAAATGGT TCGCTCGCAA AATTAATGAA ATTATTTGCC
GGGCGCCATT GGAATGCGGA TGATCATTTT TTTCGCTGCG GGCAGCTTGT AGAAGAACTT
CAGTTTGAGT GTGATGAACT GTTTCGCATG ATACGCCGTT ATGCATTAGA AAGGAAATCA
GCGGAATCTG GACGCTGCCG CTACCGATTT GAACCACAAA AAGAAAGCGG CCGCCAATGG
AGTGCAATGA AAGAATTGCT TTGGCGCATC CGCGGCCATG TTGCCAACCT TGTCGATGAA
ACAAAACAGT TGCAAACGTT TTTTTCAGAA AACAACATGG AATCATCCAT ATCTGCCCAG
CCCTATTCCT ATTTTTCCGA CGTTTCATCC TTGAACCAAA TCATTGATAC ATTATACGAT
TTAATCGAAA GCGATGACCC AATGGTGGTC CGGTGGATTG AAGCGGAAGA AAAAGGAGCA
GCTAATGCAA CGGCTGTTTA CTCACAGCCC ATTCAGTTGG ATGAATTTTT TGCCGAACGG
CTGTTTATGC CGAAAAAAAG TATTGTGCTT ACCTCGGCGA CATTAACGAT GAATGGCAGC
TTTGCATATA TAATATCCCG TTTTGGATTA AGCGATTTTT ATCCGATTTG CCAAACGATT
CCGTCTCCGT TTTCGTATAA AGAACAGGCA ATGTTAATGA TTCCAAGCGA TTTTCCGCCG
ATTTCTTCTG TATCTTTGGA AGAATATGCT GCCGTAGTTG CTGAAGGGGT GGCACAAATT
GCTAAGCGAA TCAAAGGAAA AATGCTTGTT CTCTTTACGT CATACGAATT GCTTAAATTG
ACGGCGAACG CGATGAAGAT GGAGGAACGG AATGAAGATT ATGTATTGAT TGCGCAAGGG
GTACAAAGCG GGAGTGCGAC AAAACTGACA AAAGCGTTTC AACAGTTTGA TCAAGCTATT
TTGTTTGGAA CAAGCAATTT TTGGGAAGGT GTCGATTTGC CTGGTGATGA GTTGTCGGTA
GTAGTGATTG TCCGTTTGCC ATTTGCACCG CCAGATGATC CAGTGATCGA AGCAAAAAGC
GAGTATATTC GCGCCAAAGG CGGGAATCCT TTTTACGAGT TGTCTCTTCC GGAGGCTGTA
CTTCGCTTTA AACAAGGTTT TGGAAGACTA ATTCGAACGA AAAAAGATAA GGGTGCGATA
TTTGTTTTTG ATCGCCGGCT TACATCTGCT TCATATGGAA AATACTTTTT AAACTCGTTG
CCGTCGCTTA CGATCTGTGA AGAATCGCTT GATCAACTTT TGCAAAAACT TGAAAAATGG
CTATGA
 
Protein sequence
MKDRFVIIDL ETTGNVPKKG DRIIQLGMVV IENGKIVDRF SSFFNPERDI PPFVQQLTNI 
TEQMVADAPS FAEEAAKVAD MLQQSYFVAH NVSFDLQFLQ EELHMAGLPP FSGPTIDTVE
LARIVFPTAE SYKLGDLAKQ LHIDHDQPHQ ADSDAEVTAK LFIALLNRLR QLPFITLQHL
KRLSPYLKSD LYPLLDNIIM EKMASLADEK SYIFYRGIAL KKPVSMQRKE EDHHKDASFA
AFFAGPHTLP LDHYERRDGQ WEMMKLVYEA LETSQHALIE AGTGIGKSLA YLIPSAFFAY
EQQKRVVIST HTLQLQQQLL ERDIPFLKEV VPFPLRVAVL KGKRNYLSLD KFIAFLQEHH
QNYDVVLLKC QLLVWLTQTE TGDMDELNAS SGARLFWPLL VMDEEDNGGK YNFFWRAKQR
AEEAHIVITN HAFLLHDVTA STPLLPDYEH MIMDEAHHLE EVASHYFGQQ VDYVSIRLIL
TRIGKVNENG SLAKLMKLFA GRHWNADDHF FRCGQLVEEL QFECDELFRM IRRYALERKS
AESGRCRYRF EPQKESGRQW SAMKELLWRI RGHVANLVDE TKQLQTFFSE NNMESSISAQ
PYSYFSDVSS LNQIIDTLYD LIESDDPMVV RWIEAEEKGA ANATAVYSQP IQLDEFFAER
LFMPKKSIVL TSATLTMNGS FAYIISRFGL SDFYPICQTI PSPFSYKEQA MLMIPSDFPP
ISSVSLEEYA AVVAEGVAQI AKRIKGKMLV LFTSYELLKL TANAMKMEER NEDYVLIAQG
VQSGSATKLT KAFQQFDQAI LFGTSNFWEG VDLPGDELSV VVIVRLPFAP PDDPVIEAKS
EYIRAKGGNP FYELSLPEAV LRFKQGFGRL IRTKKDKGAI FVFDRRLTSA SYGKYFLNSL
PSLTICEESL DQLLQKLEKW L