Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0538 |
Symbol | |
ID | 7979393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 606629 |
End bp | 608497 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644797534 |
Product | putative transcriptional regulator |
Protein accession | YP_002948708 |
Protein GI | 239826084 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.141611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCGA AGGATTTGTT GCGTATTATA GAAGATGGAG AAAATGCCGA ATTGGAATGT AAAGCAGCCA AAGGCGGTTT ACCGAAAGAT GTTTGGGAAA CGTATTCTGC TTTTGCGAAT ACAAATGGCG GTATTATTTT GCTTGGGGTC GAGCAAAAAG GACAAGAGTT TTTCCCAGTC GATGTTGATG CAAAAAAGTT AGTCAAAGAC TTTTGGGATG GAATTAATAA TCCCCAAAGG ATTAGTAAAA ATATCTTAGT GGATCGAAAT GTAGAAATTA TTGAAGTTGA TGGGAAAGAA ATCGTGAAAA TTTATGTTCC AAGAGCTTCA CGTGAGGACA GGCCAATTTA TATTGGTCAT AATCCTTTTA CAGGTACATA TCGCCGCAAT TATGAAGGAG ACTATAAATG TACAAAAGAA GAAGTACAGC GAATGATAGC GGACCAATCG CCAATTCCTC AAGACAGCAA AGTTCTTCCC AATTACGGTT TAAATGATTT GAATATGGAA TCGGTGAGAA GTTATCGAAA TCGCTTTTCT TCTTTGAAAC CAGATCACCC TTGGAATGGT TTAGAACTGA AGGAATTTTT ATATAAAATT GGAGCTTTGG GAAAGCTGCG CGAAACTAAC GAGGAAGGGC TAACATTAGC CGGGTTGCTC ATGTTTGGGG AAGAGCGCAG CATTACAGAG TATTTACCGC AGTATTTCTT AGAATATCGG GAAAAGAGTT TGGATGTGCC GGGGGAGAGA TGGACAGATC GGATTATTTC TTCCGACGGC ACTTGGTCGG GGAATTTGTA TGATTTTTAT TTTAAAGTGA TCCGCAGACT AACGGATGAT TTAAACATTC CTTTTCAAAT GGAGGGGCTT TTTCGCAAGG ATGATACAAG GGTGCACGCT GCTCTTCGTG AGGCTTTAGT CAATACGTTG GCTCATGCCG ATTACTACGG GCAACGCGGA ATTGTTATCG AAAAGGAAAA GATGTTATTT CGATTTTCCA ACCCGGGTGT TCTTCGGATT CCTTTGCAAC AAGCACTGAA AGGTGGAATA AGTGATCCGA GAAATCCAAC GATTTTTAAA ATGTTTATGT TAATCGGACT TGGTGAACGG GCAGGGTCGG GGATTGAGAA TATTCATTTG GCGTGGAAAG AACAACATTG GACTGCCCCA GAATTAATTG AAGAATTTCA GCCGGATCGC ACGATTCTTA CGTTGCGAAC AACTTCATTA TTACCGCAGG AAAGCGTCGA TTTTCTAAAA GCAGTTCTTG GGAAACATTA TAAGTTTTTA ACCAATGATG AAGTATTAAT TTTAGTGACG GCACATCAAG AAAATTATGT GACCAATACG AGATTGCAAA GCCTGACAGA TAAAAAGAGT GATGAAATAA GCAAATTATT ATCCGTACTG GTGGAAAAAG GATATTTAGA ACCAAATGGA CAAGGAAGAG GTACAAAATA TACGCTAACG GAAATGTTTT ATCAAAAGTC TATAGGGAAC TCCAGATATA ATCGAATAAG CTCCGGACCT AGTGCTAATA ACTCCGGACC TAACGCTGCT AACTCCGGAC CTAACGAAAA CGAACAAACA AAAGATCATG AAAAAGTGCT ATTGGATATT TCTGAATTAG CAAGGAAGAA AAAGAGATTA CATCCTTCCG AAATGGATGA GATCATTTTA AATCTTTGTG CAATTAAGCC GTTAACGCTC AAAGAATTAA GTCAATTACT GAATAGGCAG ATGGATCCTC TTCGCAAAAA GTACATTTCA CGATTGCTTA GGGAAGGAAA GTTAGAACTG CTATATCCGG AGCAAGTAAA TCATCCGAAA CAAGCATATA TGACACGATC TCTTTTTAAC CAGCATTAG
|
Protein sequence | MDAKDLLRII EDGENAELEC KAAKGGLPKD VWETYSAFAN TNGGIILLGV EQKGQEFFPV DVDAKKLVKD FWDGINNPQR ISKNILVDRN VEIIEVDGKE IVKIYVPRAS REDRPIYIGH NPFTGTYRRN YEGDYKCTKE EVQRMIADQS PIPQDSKVLP NYGLNDLNME SVRSYRNRFS SLKPDHPWNG LELKEFLYKI GALGKLRETN EEGLTLAGLL MFGEERSITE YLPQYFLEYR EKSLDVPGER WTDRIISSDG TWSGNLYDFY FKVIRRLTDD LNIPFQMEGL FRKDDTRVHA ALREALVNTL AHADYYGQRG IVIEKEKMLF RFSNPGVLRI PLQQALKGGI SDPRNPTIFK MFMLIGLGER AGSGIENIHL AWKEQHWTAP ELIEEFQPDR TILTLRTTSL LPQESVDFLK AVLGKHYKFL TNDEVLILVT AHQENYVTNT RLQSLTDKKS DEISKLLSVL VEKGYLEPNG QGRGTKYTLT EMFYQKSIGN SRYNRISSGP SANNSGPNAA NSGPNENEQT KDHEKVLLDI SELARKKKRL HPSEMDEIIL NLCAIKPLTL KELSQLLNRQ MDPLRKKYIS RLLREGKLEL LYPEQVNHPK QAYMTRSLFN QH
|
| |