Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0616 |
Symbol | |
ID | 7978805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 681160 |
End bp | 682428 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644797603 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002948777 |
Protein GI | 239826153 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000839014 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGAGAGT TGCCTCTGAG TCTGTTAGGG TGGTTTTTTC TTTGTATCGT GTTAGTTGCC TTCTTTTCTT CAGTGGAAGC CGCATTTTCT TCAGCGAATA AAATTCGCTT GAAAAATTAT GTGGAAGAAA ATCACCGTGG CAGCAAGCGA GTAAATTACA TTATGGAAAA CTTAGATCGC GTCTTATTGA CGGTGCTTGT TGCAAACAGA GTCACAAGTA TTGTTGCCGT TGCGTTTTTA GCAGATATTG CGACAACGAT GCTTGGGGAG CGTGCAGGTC TTATTGCTGC GATTATTGTC ATGACCGTGC TTCTTTTAAT CTTCGGTGAA ATTTTGCCAA AATCGATTGC GAAAGAGCAT GCGGAATCGC TATCGATTCG TTATGCCGGA ATTGTTTACG CATTGATGAA ACTGCTTTCA CCCATCACGA CATTATTTAA CGCTGTAAAA GAGAGCGTAG CGAAACGGTT TACGAATGGA ACGGTTGTTC CAGCAGTAAC GGAAGAGGAA ATTAAAGTGA TGATTGATTT AAGTGAAGAA GAAGGCATTA TTGACAACAA AGAAAAAGAA TTAATTCACC GTTCGCTCGA TTTTGATGAA ATTTTAGTTG GAGAAATTTT TACGCCACGG TCAGATATGG TCGCTGTGGA AGTAAATCAG CCGATCGAAG CAATTCGCGA TGTTTTTCTT GAAGAAAAGT ACTCCCGTAT ACCGGTGTAT GAGGAAGATA TTGATAATGT GATTGGTATT TTATCTGAAA GGGACTTTTT TAGTGAGCTT GTACAACAAA AAGATATAAA CATTCGCGCG TTATTACGCA AGCCGCTGTT TGTCGTGGAA TCAATGAAAA TTTCTGATTT GTTGCCGGAA CTTCAAAAAA GCAGAGTGCA TATGGCAATT GTCATTGATG AGTTTGGAGG AACAGCAGGA TTAATTACGC TTGAAGACAT TATTGAACAA ATTGTTGGAG AAATATGGGA TGAGCATGAT GAAGCGGTAA AAAATATTCA ACAAATCGAT GAAAACAGTT ATGAATTTAA CGCTGAACTT CCGCTTGATG AATTTTGCGA AATAATGAAA ATTGATGCAC CGGAAAGCTC TTCCCATACG TTAGGCGGTT GGATATTTGA AATGTTTGAA CGCGTGCCGA ATGTCGGCGA AACGCTGCAT TATGGTCCGC TTACTTTAAC CGTACGACAA GTCGAAAATC GCAGAATTAG GAAAGTACTT GTTTCATTAA ATGAGCCGCC GTTAGTGGAA AATATGTAA
|
Protein sequence | MGELPLSLLG WFFLCIVLVA FFSSVEAAFS SANKIRLKNY VEENHRGSKR VNYIMENLDR VLLTVLVANR VTSIVAVAFL ADIATTMLGE RAGLIAAIIV MTVLLLIFGE ILPKSIAKEH AESLSIRYAG IVYALMKLLS PITTLFNAVK ESVAKRFTNG TVVPAVTEEE IKVMIDLSEE EGIIDNKEKE LIHRSLDFDE ILVGEIFTPR SDMVAVEVNQ PIEAIRDVFL EEKYSRIPVY EEDIDNVIGI LSERDFFSEL VQQKDINIRA LLRKPLFVVE SMKISDLLPE LQKSRVHMAI VIDEFGGTAG LITLEDIIEQ IVGEIWDEHD EAVKNIQQID ENSYEFNAEL PLDEFCEIMK IDAPESSSHT LGGWIFEMFE RVPNVGETLH YGPLTLTVRQ VENRRIRKVL VSLNEPPLVE NM
|
| |