Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0353 |
Symbol | |
ID | 7977466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 402320 |
End bp | 404263 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644797344 |
Product | protein of unknown function DUF181 |
Protein accession | YP_002948544 |
Protein GI | 239825920 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.742024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCTC GCGTATTGAT GGTTGGAGAA GGAGTGTTGG CGGATTTTGT GTATGAAAAA TTGTCCGTCC AATATCAGGT AGTTCGGCAA ATCGATTTCG AGGAAGGAAT TCCGGAAGAA ACAGGTTTGG CTCTGGTGTT GCATGATGCT TGGCATCCCT CCGTTCACCA CAAGGCGGAA GAGGCATTAC GATCGTCAGG CATTCCATGG CTCCGGGGCT TTGTTTCATT TGGGGAGGGC GTGATCGGTC CGCTAGTTCG CCCTAATACC CCGGGATGTT CTCAGTGTGC TGACATGCGG CGCCTTATAG CGGGATACGA CCGCAAGGAA ATGTGGGAGC TGCAACAGGT GATGGCGGTG CAAGGGGGAA TACAGCATGA TGCATGGGCA TCACGAATAG GACTTTTGCA GATGGCTCAC CTGATTGTCG CGGAGACGCA GAGGGTGTTG GAAGGCAGTC ATACCCGTTT AGAAGAAAGG TTGTTCCTAA TCAACCTGAA AACATTAAAG AACTCATGTC ACTTCTTTCT GCCCGACCCG TTATGTACGG TATGCAGCCA ATTACCTGAC GATTCGCCGG CAGCAGCCCG CATATCGCTG CAACCAAGTC CGAAGATCAG CGCTGACAGT TACCGCTGCC GTCCGATCGA GGAGCTGAAA GAAGTTCTGA TCAAAGACTA TCTAGATTAT CGAACTGGAT TATTGAATGG TAAAATGCAT AATTTCGCGT TGCCGTTTGC GGATGTTGTT GTAAATATGC CGATGTTTAT AGGGGACGAG GGAGTAGCAG GCCGGACTCA TTCCTATGAG GTTAGCGAGT TAACCGCCAT TTTGGAGGGG TTAGAGAGAT ATTGTGGCAT CGAAGCTCGT GGCAAACGGA CAGTGATTCA TGACAGCTAC CGAAATTTGA AAGATCAAGC ACTCAACCCA GTAAAGGTAG GAGTGCATGC GAAGGAACAG TATGCGCGAC CTGATTTTCC GTTCAAACCG TTTCATCCGG ATCGTCCAAT GAATTGGGTA TGGGGCTATT CGCTTTTACA AGAGCGTCCG ATTTTGGTTC CAGAGTTGCT CGCATATTAC AGTTTGGGAG GTGGGGATGG CTTTGTCTAT GAAACTTCCA ACGGATGTGC ATTAGGCGGG AGTTTAGAAG AAGCGATTTT CCATGCCATT TTGGAGGTGG TGGAGCGCGA TTCATTCTTG ATGGCTTGGT ATGCGCAGCT GCCTCTTCCG CGTCTTGACC TTCGTTCGGC TAACGATAAA GAATTACAGT TGATGGTCGA TCGTGTACGT GCGGTGGCGG GATATGATCT GTATTTTTTC AACTCGACGA TGGAGCACGG AATTCCAAGC GTCTGGGCAG TGGCGAAAAA CAGAAAACAA AAGGGATTGA ATCTCATCTG TGCCGCTGGA TCTCATCCGG ACCCTATACG GGCGGTGAAA AGCTCGATTC ACGAGTTAGC AGGCATGATG CTTGTGCTTG ACGAGAAATT TGAGGCAAAC CGAAAGAAAT ATGAAAAAAT GTTGCATGAT CCGCTATTAG TGCGGCAGAT GGAAGACCAT GGCATGCTGT ACGGTTTGCC GGAAGCAGAG GAGCGCCTGC AATTTTTGTT GGATGATCAT CGTCCGTTGC GAACGTTTGA AGAGGAATTC AAGCAGCAAA CGAAGAATGC AGACTTGACG GATGACCTGC GGGTTATTCT TCAGAAGTTC CGACGATTGA ATCTTGAGGT AATTGTCGTG GACCAGACAA CACCTGTCAT CAAACGGAAT GGATTATATT GTGTGAAAGT ACTGATTCCG GGAATGTTAC CGATGACATT TGGGCATCAT CTTACCCGCG TGACAGGCCT GGAGAGGGTG CTCCGGGTAC CAATGGAACT CGGGTATACG ACAAAACCGC TCACGCTTGA ACAGCTTAAT CCACATCCCC ATCCGTTCCC ATAG
|
Protein sequence | MGARVLMVGE GVLADFVYEK LSVQYQVVRQ IDFEEGIPEE TGLALVLHDA WHPSVHHKAE EALRSSGIPW LRGFVSFGEG VIGPLVRPNT PGCSQCADMR RLIAGYDRKE MWELQQVMAV QGGIQHDAWA SRIGLLQMAH LIVAETQRVL EGSHTRLEER LFLINLKTLK NSCHFFLPDP LCTVCSQLPD DSPAAARISL QPSPKISADS YRCRPIEELK EVLIKDYLDY RTGLLNGKMH NFALPFADVV VNMPMFIGDE GVAGRTHSYE VSELTAILEG LERYCGIEAR GKRTVIHDSY RNLKDQALNP VKVGVHAKEQ YARPDFPFKP FHPDRPMNWV WGYSLLQERP ILVPELLAYY SLGGGDGFVY ETSNGCALGG SLEEAIFHAI LEVVERDSFL MAWYAQLPLP RLDLRSANDK ELQLMVDRVR AVAGYDLYFF NSTMEHGIPS VWAVAKNRKQ KGLNLICAAG SHPDPIRAVK SSIHELAGMM LVLDEKFEAN RKKYEKMLHD PLLVRQMEDH GMLYGLPEAE ERLQFLLDDH RPLRTFEEEF KQQTKNADLT DDLRVILQKF RRLNLEVIVV DQTTPVIKRN GLYCVKVLIP GMLPMTFGHH LTRVTGLERV LRVPMELGYT TKPLTLEQLN PHPHPFP
|
| |