Gene GWCH70_2633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2633 
Symbol 
ID7978296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2667969 
End bp2669687 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content46% 
IMG OID644799434 
Producthypothetical protein 
Protein accessionYP_002950593 
Protein GI239827969 
COG category[L] Replication, recombination and repair 
COG ID[COG1796] DNA polymerase IV (family X) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000221598 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAA ATAAAAAAGA TGTCATTCGT CTGCTGGAGA CGATTGCGTT ATATATGGAA 
ATCAAAGGAG AAAATCCGTT TAAAATTGCG GCGTTTCGCA AAGCGGCAAG CGCATTGGAA
ACTGATGAAC GAAGCATTGC GGAAATAGAC GATTTCACCG CGATTCCCGG CATCGGAAAG
GGGACGGCAA GCGTTATTCA TGAATTTTTG GAAACAGGAA CGTCCAGCGT TCTTGAACAA
TTGAAACAGA AAATTCCAGA GAGCTTGCTC ACCCTGCTTC GGCTTCCCGG ACTTGGCGGC
AAGAAAATCG CAAAGCTATA TCAAGAACTG GGCATTGTGG ATATTGCTTC ATTGAAAGAA
GCTTGCCTCG AGCAGAAAGT ACAGCAGCTT CCAGGCTTCG GAAAGAAAAC GGAAGAAAAG
CTTTTAGCCG CCATTGAAGA AATCGGCTCT CGCCCGGAAC GGCTTCCGTT AGCCTTTGTG
CTGCCGATTG CCGAGGAAAT AGAAAATCAG CTTAAAAATA TGGAAGGAAT CGTTCGTTTT
TCTCGTGCCG GCAGTTTGCG GCGAATGAAG GAAACAGTGA AAGATTTAGA TTTCATTATC
GCAACGAACG ATCCGCATCT TGTACGGGAA CATTTATTGA AGCTAGCGAA TGTCTCAGAT
GTGATTGCAA ACGGTGATAC GAAAGTGTCC CTAGAGCTTC GCTATGAGTA TGAAATTGCT
GTTGATTTTC GTTTAGTGAC ACCGGAGCAG TTCGCCACGA CGCTCCATCA TTTTACGGGA
TCGAAAGAAC ATAATGTCCG CATGCGGCAG CTGGCAAAAG AGCGCGGTGA AAAAATTAGT
GAGTATGGAG TAGAAAATGT GAAAACGGGA GAAGTGAAAA CATTTTCTGA TGAACAAGCG
TTTTTTGCCC ATTTCCAGTT GCCGTTTATT CCGCCGGAAC TAAGGGAAGA TGGGACCGAA
GTCGACCGCT ATCGCGACGA CTATTCGCTT CTTTGCCTTT CCCACATTCA AGGAGATTTG
CATATGCACT CTACTTGGAG CGACGGAGCG TGCTCGATCG AGGAAATGGC GGAAGCATGC
CGGAAAAAAG GCTACCGCTA TATGGCGATT ACCGATCATT CTCAATATTT AAAGGTCGCC
AACGGGCTGA CGGTCGAACG GTTAAAGCGG CAGCGCGAAG AAATTGAACG GTTAAACGCG
AAATATGATG ATTTTACAAT TTTGGCTGGA ATAGAAATGG ATATTTTGCC AGATGGGACG
CTTGATTACG ATGATGGCGT TCTCGAAGAA CTTGACTTTG TCATCGCTGC GATTCATTCA
AGTTTTTCCC AGTCGCGTGA CGTGATTATG AAGCGTCTTG CTGCTGCGCT TCGCAATCGT
CATGTTGATT TGATCGCTCA TCCGACAGGG CGGTTAATCG GAAAGCGAGA CGGATATGAC
GTAGATATAG ACATGCTTAT CGAATTGGCG CGGGAAACGA ATACGGCGCT TGAGTTAAAT
GCGAATCCGA ACCGTCTCGA TTTGTCGTAT TCCTATTTGA AAAAAGCGCA AGATGCTGGA
GTAAAAATCG CGATTAATAC AGATGCCCAC CATTTGGACA TGCTTGACCA TATGGAAATA
GGGGTCATAA CCGCACGAAA AGGATGGATT CGCAAGGAAA CGGTGATCAA TACATGGTCT
CTTGAAGAGT TGCGCAGCTT TTTGCAACGC AATCGGTAA
 
Protein sequence
MKGNKKDVIR LLETIALYME IKGENPFKIA AFRKAASALE TDERSIAEID DFTAIPGIGK 
GTASVIHEFL ETGTSSVLEQ LKQKIPESLL TLLRLPGLGG KKIAKLYQEL GIVDIASLKE
ACLEQKVQQL PGFGKKTEEK LLAAIEEIGS RPERLPLAFV LPIAEEIENQ LKNMEGIVRF
SRAGSLRRMK ETVKDLDFII ATNDPHLVRE HLLKLANVSD VIANGDTKVS LELRYEYEIA
VDFRLVTPEQ FATTLHHFTG SKEHNVRMRQ LAKERGEKIS EYGVENVKTG EVKTFSDEQA
FFAHFQLPFI PPELREDGTE VDRYRDDYSL LCLSHIQGDL HMHSTWSDGA CSIEEMAEAC
RKKGYRYMAI TDHSQYLKVA NGLTVERLKR QREEIERLNA KYDDFTILAG IEMDILPDGT
LDYDDGVLEE LDFVIAAIHS SFSQSRDVIM KRLAAALRNR HVDLIAHPTG RLIGKRDGYD
VDIDMLIELA RETNTALELN ANPNRLDLSY SYLKKAQDAG VKIAINTDAH HLDMLDHMEI
GVITARKGWI RKETVINTWS LEELRSFLQR NR