Gene GWCH70_2992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2992 
Symbol 
ID7977362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3012569 
End bp3014548 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content47% 
IMG OID644799792 
Productexcinuclease ABC subunit B 
Protein accessionYP_002950931 
Protein GI239828307 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000124238 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGGACC GTTTTGAGTT AGTGTCGGCG TATAAGCCGC AAGGAGATCA ACCAAAAGCG 
ATTGCCAAAT TGGTCGAAGG AATTCGAAAA GGAGTAAAGC ATCAAACGCT GTTAGGGGCA
ACAGGAACGG GAAAGACGTT TACGATTTCC AACGTCATTA AAGAGGTGAA CAAACCGACG
CTTGTGATTG CCCATAATAA AACGTTAGCG GGGCAGTTGT ACAGCGAGTT AAAAGAGTTT
TTTCCAAACA ACGCGGTTGA ATATTTTGTC AGCTATTACG ATTATTATCA GCCGGAGGCG
TATGTGCCGC AGACCGATAC GTATATTGAA AAAGATGCAA GCATTAACGA TGAAATTGAT
AAGTTACGGC ACTCGGCAAC ATCAGCGCTA TTTGAGCGGC GCGATGTGAT TATTGTCGCC
AGTGTATCGT GCATTTACGG ATTAGGATCA CCGGAAGAAT ATCGCGAACT GGTTGTGTCG
TTGCGCGTCG GGATGGAAAT CGAGCGCAAC GCTTTATTGC GCCGCCTTGT CGATATCCAA
TATGAACGGA ACGACATTGA TTTTCAGCGC GGAACGTTCC GTGTTCGCGG AGATGTCGTC
GAGATTTTTC CTGCTTCTCG GGACGAGCAT TGTATTCGCG TTGAATTTTT TGGCGATGAA
ATTGACCGCA TTCGCGAAGT CGACGCATTG ACGGGGGAAA TTATCGCAGA ACGCGAGCAT
GTCGCGATTT TCCCGGCATC CCACTTCGTT ACGCGTGAAG AAAAAATGCG TTTAGCGATC
GAAAATATTG AAAAAGAATT AGAAGAGCGG CTGCGCGAAT TGCGGGAACA AGGAAAACTG
TTAGAAGCGC AGCGGCTTGA GCAACGGACT CGTTACGATT TAGAGATGAT GAGAGAAATG
GGCTTTTGCT CAGGGATTGA AAACTACTCC CGGCATTTAG CGTTGCGTCC GCCAGGCTCG
ACGCCGTACA CGCTGCTTGA TTATTTTCCA GATGATTTTT TGATTATCAT CGATGAGTCA
CACGTGACAT TGCCGCAAAT TCGCGGCATG TATAACGGAG ACCGGGCGCG CAAGCAAGTG
CTTGTCGATC ATGGCTTCCG TCTGCCATCT GCCCTCGATA ACCGCCCGTT AACGTTTGAG
GAGTTTGAAC AAAAAATTAA CCAAATTATT TATGTTTCCG CGACACCTGG TCCGTACGAG
CTGGAACATA GCCCGGAAGT TGTGGAACAA ATTATTCGTC CGACAGGGCT GTTGGATCCA
ACGATTGACG TTCGTCCGAT TGAAGGGCAA ATCGATGATT TAATCGGAGA AATTCATGAG
CGGATCAAGC GGAATGAACG CACTCTCGTT ACGACGTTAA CGAAGAAAAT GGCGGAAGAT
TTAACGGATT ACTTAAAAGA AGTCGGCATT AAAGTTGCGT ATTTACATTC CGAAATTAAA
ACGCTCGAGC GCATTGAAAT TATTCGCGAT TTGCGCATGG GCAAATACGA TGTGCTCGTC
GGGATTAACT TGTTGCGGGA AGGATTGGAT ATTCCGGAAG TGTCGCTTGT CGCCATTCTC
GATGCGGATA AAGAAGGCTT TTTGCGCTCG GAACGTTCGC TTATTCAAAC GATTGGGCGT
GCGGCGCGAA ACGCCAACGG CCATGTCATT ATGTACGCCG ATACGATTAC AAAATCGATG
GAAATTGCCA TCAACGAAAC GAAACGGCGC CGTGCGATTC AAGAAGCGTA TAACAAAAAG
CACGGCATCG TTCCGCAGAC GGTGAAGAAA GAAATTCGCG ATGTCATCCG TGCGACTTAC
GCGGCGGAAG AGAAAGAAAC GTACGATACG AAACCATCTT ACGGCAAGAT GGCAAAGAAA
GAACGAGAAA AGCTCATTGC CGATTTAGAA AAAGAAATGA AAGAAGCAGC AAAAGCGCTT
GATTTCGAGC GCGCTGCCCA ATTGCGTGAT ATTATTTTTG AGTTAAAAGC GGAAGGATGA
 
Protein sequence
MGDRFELVSA YKPQGDQPKA IAKLVEGIRK GVKHQTLLGA TGTGKTFTIS NVIKEVNKPT 
LVIAHNKTLA GQLYSELKEF FPNNAVEYFV SYYDYYQPEA YVPQTDTYIE KDASINDEID
KLRHSATSAL FERRDVIIVA SVSCIYGLGS PEEYRELVVS LRVGMEIERN ALLRRLVDIQ
YERNDIDFQR GTFRVRGDVV EIFPASRDEH CIRVEFFGDE IDRIREVDAL TGEIIAEREH
VAIFPASHFV TREEKMRLAI ENIEKELEER LRELREQGKL LEAQRLEQRT RYDLEMMREM
GFCSGIENYS RHLALRPPGS TPYTLLDYFP DDFLIIIDES HVTLPQIRGM YNGDRARKQV
LVDHGFRLPS ALDNRPLTFE EFEQKINQII YVSATPGPYE LEHSPEVVEQ IIRPTGLLDP
TIDVRPIEGQ IDDLIGEIHE RIKRNERTLV TTLTKKMAED LTDYLKEVGI KVAYLHSEIK
TLERIEIIRD LRMGKYDVLV GINLLREGLD IPEVSLVAIL DADKEGFLRS ERSLIQTIGR
AARNANGHVI MYADTITKSM EIAINETKRR RAIQEAYNKK HGIVPQTVKK EIRDVIRATY
AAEEKETYDT KPSYGKMAKK EREKLIADLE KEMKEAAKAL DFERAAQLRD IIFELKAEG