Gene GWCH70_0353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0353 
Symbol 
ID7977466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp402320 
End bp404263 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content49% 
IMG OID644797344 
Productprotein of unknown function DUF181 
Protein accessionYP_002948544 
Protein GI239825920 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.742024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGCTC GCGTATTGAT GGTTGGAGAA GGAGTGTTGG CGGATTTTGT GTATGAAAAA 
TTGTCCGTCC AATATCAGGT AGTTCGGCAA ATCGATTTCG AGGAAGGAAT TCCGGAAGAA
ACAGGTTTGG CTCTGGTGTT GCATGATGCT TGGCATCCCT CCGTTCACCA CAAGGCGGAA
GAGGCATTAC GATCGTCAGG CATTCCATGG CTCCGGGGCT TTGTTTCATT TGGGGAGGGC
GTGATCGGTC CGCTAGTTCG CCCTAATACC CCGGGATGTT CTCAGTGTGC TGACATGCGG
CGCCTTATAG CGGGATACGA CCGCAAGGAA ATGTGGGAGC TGCAACAGGT GATGGCGGTG
CAAGGGGGAA TACAGCATGA TGCATGGGCA TCACGAATAG GACTTTTGCA GATGGCTCAC
CTGATTGTCG CGGAGACGCA GAGGGTGTTG GAAGGCAGTC ATACCCGTTT AGAAGAAAGG
TTGTTCCTAA TCAACCTGAA AACATTAAAG AACTCATGTC ACTTCTTTCT GCCCGACCCG
TTATGTACGG TATGCAGCCA ATTACCTGAC GATTCGCCGG CAGCAGCCCG CATATCGCTG
CAACCAAGTC CGAAGATCAG CGCTGACAGT TACCGCTGCC GTCCGATCGA GGAGCTGAAA
GAAGTTCTGA TCAAAGACTA TCTAGATTAT CGAACTGGAT TATTGAATGG TAAAATGCAT
AATTTCGCGT TGCCGTTTGC GGATGTTGTT GTAAATATGC CGATGTTTAT AGGGGACGAG
GGAGTAGCAG GCCGGACTCA TTCCTATGAG GTTAGCGAGT TAACCGCCAT TTTGGAGGGG
TTAGAGAGAT ATTGTGGCAT CGAAGCTCGT GGCAAACGGA CAGTGATTCA TGACAGCTAC
CGAAATTTGA AAGATCAAGC ACTCAACCCA GTAAAGGTAG GAGTGCATGC GAAGGAACAG
TATGCGCGAC CTGATTTTCC GTTCAAACCG TTTCATCCGG ATCGTCCAAT GAATTGGGTA
TGGGGCTATT CGCTTTTACA AGAGCGTCCG ATTTTGGTTC CAGAGTTGCT CGCATATTAC
AGTTTGGGAG GTGGGGATGG CTTTGTCTAT GAAACTTCCA ACGGATGTGC ATTAGGCGGG
AGTTTAGAAG AAGCGATTTT CCATGCCATT TTGGAGGTGG TGGAGCGCGA TTCATTCTTG
ATGGCTTGGT ATGCGCAGCT GCCTCTTCCG CGTCTTGACC TTCGTTCGGC TAACGATAAA
GAATTACAGT TGATGGTCGA TCGTGTACGT GCGGTGGCGG GATATGATCT GTATTTTTTC
AACTCGACGA TGGAGCACGG AATTCCAAGC GTCTGGGCAG TGGCGAAAAA CAGAAAACAA
AAGGGATTGA ATCTCATCTG TGCCGCTGGA TCTCATCCGG ACCCTATACG GGCGGTGAAA
AGCTCGATTC ACGAGTTAGC AGGCATGATG CTTGTGCTTG ACGAGAAATT TGAGGCAAAC
CGAAAGAAAT ATGAAAAAAT GTTGCATGAT CCGCTATTAG TGCGGCAGAT GGAAGACCAT
GGCATGCTGT ACGGTTTGCC GGAAGCAGAG GAGCGCCTGC AATTTTTGTT GGATGATCAT
CGTCCGTTGC GAACGTTTGA AGAGGAATTC AAGCAGCAAA CGAAGAATGC AGACTTGACG
GATGACCTGC GGGTTATTCT TCAGAAGTTC CGACGATTGA ATCTTGAGGT AATTGTCGTG
GACCAGACAA CACCTGTCAT CAAACGGAAT GGATTATATT GTGTGAAAGT ACTGATTCCG
GGAATGTTAC CGATGACATT TGGGCATCAT CTTACCCGCG TGACAGGCCT GGAGAGGGTG
CTCCGGGTAC CAATGGAACT CGGGTATACG ACAAAACCGC TCACGCTTGA ACAGCTTAAT
CCACATCCCC ATCCGTTCCC ATAG
 
Protein sequence
MGARVLMVGE GVLADFVYEK LSVQYQVVRQ IDFEEGIPEE TGLALVLHDA WHPSVHHKAE 
EALRSSGIPW LRGFVSFGEG VIGPLVRPNT PGCSQCADMR RLIAGYDRKE MWELQQVMAV
QGGIQHDAWA SRIGLLQMAH LIVAETQRVL EGSHTRLEER LFLINLKTLK NSCHFFLPDP
LCTVCSQLPD DSPAAARISL QPSPKISADS YRCRPIEELK EVLIKDYLDY RTGLLNGKMH
NFALPFADVV VNMPMFIGDE GVAGRTHSYE VSELTAILEG LERYCGIEAR GKRTVIHDSY
RNLKDQALNP VKVGVHAKEQ YARPDFPFKP FHPDRPMNWV WGYSLLQERP ILVPELLAYY
SLGGGDGFVY ETSNGCALGG SLEEAIFHAI LEVVERDSFL MAWYAQLPLP RLDLRSANDK
ELQLMVDRVR AVAGYDLYFF NSTMEHGIPS VWAVAKNRKQ KGLNLICAAG SHPDPIRAVK
SSIHELAGMM LVLDEKFEAN RKKYEKMLHD PLLVRQMEDH GMLYGLPEAE ERLQFLLDDH
RPLRTFEEEF KQQTKNADLT DDLRVILQKF RRLNLEVIVV DQTTPVIKRN GLYCVKVLIP
GMLPMTFGHH LTRVTGLERV LRVPMELGYT TKPLTLEQLN PHPHPFP