Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0509 |
Symbol | |
ID | 7979388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 579259 |
End bp | 580416 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644797511 |
Product | hypothetical protein |
Protein accession | YP_002948685 |
Protein GI | 239826061 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02877] sporulation protein YhbH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000608063 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGAA ACTTTGTTAT ATCGAAAGAA GACTGGTCCC TCCATCGGAA AGGACATGAC GATCAAAAAC GACATGAAGA AAAAGTAAAA GAAGCAATTA AAAACAATTT GCCAGATTTA ATTACAGAAG AAAGCATCAT TATGTCAAAC GGACGTGACG TCATTAAAAT TCCGATCCGT TCTTTGGATG AATATAAAAT TCGTTATAAT TATGAGAAAA ACAAACATGT TGGTCAAGGG AACGGAGACA GTCAAGTTGG CGATGTTGTT GCAAGAGATG GATCAGGGGA GGGACAAGGG CCTGGAACAG GGCAAGGAGC CGGCGACTTG CCCGGGCAAG ATTATTATGA AGCGGAAGTT TCGCTAATGG AATTGGAAGA AGCGTTATTC AGCCAATTGG AGCTGCCGAA TTTACAAAGA AAAGAAGCGG ACCAAAACGT TGTAGAACAT ATTGAATTTA ATGATATTCG CCGTACTGGT TTAACGGGGA ATATCGATAA AAAAAGAACG ATGCTCGCGG CATTTAAGAG AAATGCGATG AATGGCAATC CAAGTTTTTA TCCGATTTAT CGTGAAGATT TAAGATATAG AACATGGAAT GAAGTGGTAA AGCCAGATTC GAAAGCAGTT GTTCTTGCGA TGATGGATAC AAGCGGCTCA ATGGGATTGT GGGAAAAATA TATGGCACGC AGTTTCTTCT TTTGGATGAC CCGCTTTTTA CGTACAAAAT ATGAAACGGT GGAAATCGCA TTTATCGCCC ATCATACAGA AGCAAAAGTG GTTACCGAAG ATGAATTTTT CACAAAAGGG GAAAGCGGAG GCACGATCTG TTCTTCCGCC TATCGAAAAG CGTTGGAGCT CATTGAAACG AAATACTCGC CATCGCGTTA TAATATTTAT CCGTTCCACT TTTCGGATGG AGATAACTTG ACATCAGATA ACGCCCGTTG TGTCAAACTT GTTCAAGAAC TAATGAAAGT GTCCAACATG TTTGGATATG GAGAAGTAAA TCAGTATAAC CGCCACTCGA CGCTAATGTC TGCTTACAAA AACATCAAAG ATGAAAAATT CCGTTATTAT ATCCTCAAAC AAAAATCAGA TGTATTCCAT GCGATGAAAA CATTTTTCCG GAAAGAAGAA AATAAGGCAT TCGTATGA
|
Protein sequence | MKGNFVISKE DWSLHRKGHD DQKRHEEKVK EAIKNNLPDL ITEESIIMSN GRDVIKIPIR SLDEYKIRYN YEKNKHVGQG NGDSQVGDVV ARDGSGEGQG PGTGQGAGDL PGQDYYEAEV SLMELEEALF SQLELPNLQR KEADQNVVEH IEFNDIRRTG LTGNIDKKRT MLAAFKRNAM NGNPSFYPIY REDLRYRTWN EVVKPDSKAV VLAMMDTSGS MGLWEKYMAR SFFFWMTRFL RTKYETVEIA FIAHHTEAKV VTEDEFFTKG ESGGTICSSA YRKALELIET KYSPSRYNIY PFHFSDGDNL TSDNARCVKL VQELMKVSNM FGYGEVNQYN RHSTLMSAYK NIKDEKFRYY ILKQKSDVFH AMKTFFRKEE NKAFV
|
| |