Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2963 |
Symbol | |
ID | 7977262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2986030 |
End bp | 2987049 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799762 |
Product | transcriptional regulator, DeoR family |
Protein accession | YP_002950902 |
Protein GI | 239828278 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000456437 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCAT TAATTGAAGC GCAAAAAAAA TTATTGCCCG ACCTTCTTGA AGTTATGCAA AAAAGATATC GTATTTTGCA CTACATTTCG CTGATGCAGC CAATCGGGAG GAGAGTACTG TCGGGCAGTT TAGGAATAAC TGAGCGTGTG CTTCGTTCGG AAGTGCAGTT TTTGAAAGAG CAAAATTTGC TAACCATCAG TTCCGCCGGC ATGAGTTTGA CCCAAGAAGG CAAGGAATTG TTGCATGCGC TCGAGGATGT TATGAGAGAA TTATTAGGCC TAAAAGATTT AGAAACAAAA TTAATGGAGA AACTTCATAT TGAAGGGGTT ATCGTTGTTG CTGGAGATAG TGACCTTTCT CCTTGGGTAA AAAAAGAAAT GGGAAGAGCT TGTGTGACCT GCATGAAAGA AAGATTGACA AACGGCGACA TCGTGGCAGT GACAGGAGGC ACAACACTTG CCGCTGTTGC CGAGATGATG ACTCCCGATC CAAAACAAAA CCGTATTTTA TTTGTTCCGG CACGGGGCGG CTTAGGTGAA GATGTGAAAA ACCAAGCGAA CACCATTTGT GCAAAGATGG CGGAAAAAGC GTTAGGAAAC TATCGGCTTC TTCATGTTCC TGACCACCTA AGTCGTGAGG CGTATGAATC TTTAATTGAA GAACCGACTG TGAAAGAAGT GCTGGAACTT ATTAAATCAT GCCGCATGGT CGTGCATGGA ATCGGAGATG CTATCACAAT GGCGGAACGC CGGAAAACCT CAAAAGAGGA TATGGAGAAA ATTAAAGAAC GACATGCGGT TGCCGAAGCG TTTGGCTATT ACTTTAATGA AGCAGGAGAA GTTGTGCATA AAGTGAAAAC GGTTGGCATT CAGTTGGAAG ACCTTCCGCG TGTCGAACAT GTCATTGCGG TGGCTGGAGG GTCTTCCAAG GCAAAAGCCA TTCAGGCGTA TATGAAACAA GCGCCTCATT CCATTCTTAT TACCGATGAA GGTGCAGCAA GAGCGTTACT AGGGGAGTAA
|
Protein sequence | MQSLIEAQKK LLPDLLEVMQ KRYRILHYIS LMQPIGRRVL SGSLGITERV LRSEVQFLKE QNLLTISSAG MSLTQEGKEL LHALEDVMRE LLGLKDLETK LMEKLHIEGV IVVAGDSDLS PWVKKEMGRA CVTCMKERLT NGDIVAVTGG TTLAAVAEMM TPDPKQNRIL FVPARGGLGE DVKNQANTIC AKMAEKALGN YRLLHVPDHL SREAYESLIE EPTVKEVLEL IKSCRMVVHG IGDAITMAER RKTSKEDMEK IKERHAVAEA FGYYFNEAGE VVHKVKTVGI QLEDLPRVEH VIAVAGGSSK AKAIQAYMKQ APHSILITDE GAARALLGE
|
| |