Gene GWCH70_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3041 
Symbol 
ID7977405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3058458 
End bp3059957 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content51% 
IMG OID644799835 
Producthelicase domain protein 
Protein accessionYP_002950974 
Protein GI239828350 
COG category[L] Replication, recombination and repair 
COG ID[COG4098] Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0601765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGTTTTA TTGTGGATGA AGGAAGGTTG ATTCCCGAAG CATTGGCCAA AACCAATGAC 
CAAACTGCTA AACCGATCAG CTACATTGAT GTCGCGTCTT CTATCCCAAT GCACCCCGAG
TTTCCTTACT CCCCAGAACT CCTTTCCTTT CTGGAAGGAA AACAGCTTCT CCTCGAGGAA
CTTCCTTTTC CTCTCGAGAT GATTCAAGCC CATTATGAAC ACGGCTACCT TTCCTATGAA
AAAGGGATTG CGAAAACGAA ACATGGATGG CGTTGCATGA GGTGCGGAAA CGAGGAGAAT
CATTTTTTCG CCTCATTTCC TTGTGCACGC TGTCAAGCGG TTTGTACATA TTGCCGCAAA
TGCATTATGA TGGGGCGAGT CAGTACATGC ACCCCGCTGG TTGTATCCCG CTTTTCTTTC
CCTCAAGCTT GCTATTTTTC CCCGCTTTCC TGGAACGGAA TATTATCCCA AGGCCAGCAG
CGTGCTGCCG ATGCGGTGGA GGACGCAATC GTGCGGAATG ACGAATTGCT CGTATGGGCG
GTTTGTGGCG CTGGAAAAAC GGAGGTATTG TTTCCAGGCA TCGCGCGGGC GCTTGAGATG
GGAAAACGCG TATGTATTGC CACCCCAAGA ACCGACGTCG TGCGCGAGCT TGCCCCCCGT
TTGAAACAAG CATTTCCAAG CGTCCCATTG ATCGCCTTGT ACGGCGGCAG CGACGACCGC
GGCAAATTCG CCCCTTTTGT TATTTCCACC ACCCATCAGC TATTACGGTT TTACCGCGCT
TTTGATGTGA TGGTGATTGA TGAAGTCGAC GCCTTCCCGT ATTCGATGGA ACCGATGCTT
GAATATGCTG TCGCAAAAGC GCGCAAAGAG ACATCCAGTC TTATTTATTT AACGGCAACT
CCACATCCAG CTTGGCAGAA AGAAATCAAG CGCGGCAAAC GAAAAGCGGT CACCATTCCC
GCCCGCTACC ACGGTTTTCC CCTTCCTGTC CCGTCCTTCG AATGGTGTGG CAACTGGCGC
AAGCAGTTAA AGCGCAGTCG TCTTCCCCGC AACGTCATCA CCTGGGTGAA ATTGCGCATT
GAAACAGCAA AACAAGCGTT TTTATTCGTC CCCCATATTG ATGAGCTCGA GCAAGTTGTA
CGTATATTGA AACAATTAGA CGAGCGGATC GAAGGCGTTC ACGCGGAAGA TCCGAAGCGC
GCGGAAAAAG TGCAAGCGTT TCGTGACGGT CGCATTCCGC TTCTTGTCAC TACGACGATT
TTGGAACGCG GCGTGACCGT TCCGAACATC GATGTTGCCG TGCTTGGCGC GGAAGACCGC
ATTTTTACGG AAAGCGCGCT CGTGCAAATT GCCGGCCGCG TCGGGCGAAG CGCTCAATTT
CCAAGCGGTG ACATCCGTTT TTTCCATTAC GGAAAAACGC GGGAAATGGT CGCGGCGAAA
CGACAGATTG AGCGAATGAA TAAGGAGGCT TCAGAAAGGG GGATGTTAAA AACGCAATGA
 
Protein sequence
MRFIVDEGRL IPEALAKTND QTAKPISYID VASSIPMHPE FPYSPELLSF LEGKQLLLEE 
LPFPLEMIQA HYEHGYLSYE KGIAKTKHGW RCMRCGNEEN HFFASFPCAR CQAVCTYCRK
CIMMGRVSTC TPLVVSRFSF PQACYFSPLS WNGILSQGQQ RAADAVEDAI VRNDELLVWA
VCGAGKTEVL FPGIARALEM GKRVCIATPR TDVVRELAPR LKQAFPSVPL IALYGGSDDR
GKFAPFVIST THQLLRFYRA FDVMVIDEVD AFPYSMEPML EYAVAKARKE TSSLIYLTAT
PHPAWQKEIK RGKRKAVTIP ARYHGFPLPV PSFEWCGNWR KQLKRSRLPR NVITWVKLRI
ETAKQAFLFV PHIDELEQVV RILKQLDERI EGVHAEDPKR AEKVQAFRDG RIPLLVTTTI
LERGVTVPNI DVAVLGAEDR IFTESALVQI AGRVGRSAQF PSGDIRFFHY GKTREMVAAK
RQIERMNKEA SERGMLKTQ