Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2965 |
Symbol | |
ID | 7977264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2987425 |
End bp | 2988738 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644799764 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_002950904 |
Protein GI | 239828280 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000225777 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGCCG AATTATTGCA GGAGCAGCGA CTGAAGCTAT CGCTTACAAA GGAATTGACT CAAGCGATTG AACTATTGCA ATATTCTGCT GTGGAATTAC AGTCCTTTTT GTATGAGCAG TCGTTGGAAA ATCCTTTTCT CGAAATTCGC GACTACCGGT TAAAGCGAAA CTTCCGCAAC TTATCGGATA AGGAAAAACA GCAGTGGATT GAAAATATGA GCACTTATTC GGAAACGCTT TCATCATATT TAACTGCACA GCTTCCAGCT CTTTCGCTTT CGGAACACGA AGAACGCATC GTGCATTATA TGATTGCTTG CCTTGATGAA GATGGGTATT TGCGGGTAAA TATCGAAGAA ATTGCAGAAC GATTTGCTAT TTCCAAACAG GAAGCAGAAA AGGCGCTTCA GATCATTCAG TCATTAGAGC CGGCTGGCGT TGGCGCCCGC AACTTGCAAG AATGTTTATA TTTACAGTTA AAGCGCTTAC CATATCGCGA TGAATTCGCG GAACAGATTA TACAGCACCA TTTTTCGTTA TTTGTCGAAA AAGCTTGGAA AACGTTGGCG AAACAACTTG GCGTGGATAT CGCTTCATTG CAGCGTGTAT GGGATTTAAT TCGCACGCTT GAACCGCGTC CTGGCATCCA TTATACAAAA GAGAGACCGC ATTTTATTGT TCCCGATATA ATTGTCCAAC GCAGCGAAGA AGGTGATTGG CGCATTTTTT ACAATGAAGA TGTGCATCCG GAACTCATCT GGAATCGGGG ATACGAGCAA AAGATTTCCA GCTGTCAAGA CGGGCAGGTT CACGCGTTTG TAAAAGATAA ATATCGCCAG TTTTTATGGT TAGCAAAAAG TCTGGAACAG CGCAAACAAA CATTGCTGAA CATTATGCAT GTCATTGTCG ATAAACAAAA ACAATGTTTT GAAACTGGTT TTGCAGCCTT AAAGCCCCTT ACAATGCGCG AAGTTGCGGA AGAGCTTGGC ATTCACGAAT CTACTGTCAG CCGCGCGGTA AAAAACAAAT ATGTTCAAGC GCCGTTTGGC ACAGTGGAAC TTCGCCGCTT TTTTTCAAGC GCTGTTTCTT CTGTTTACAT GGATGAAGAT GCTGCCTCTT CTGTAAAAGT AAAAATGTTT ATCAGGCAGT TAATTGAACA GGAAAATAAG CAGGAGCCGC TTTCTGACCA AAAGCTCGCC GATTTATTAC ATGAGCAATA CGGTGTGGTG ATCTCGCGCA GAACGGTCGC AAAATATCGC GAACAACTGC ATATTCCATC ATCTGCAAAA CGAAAACAGT ATGTAGGGAA GTGA
|
Protein sequence | MRAELLQEQR LKLSLTKELT QAIELLQYSA VELQSFLYEQ SLENPFLEIR DYRLKRNFRN LSDKEKQQWI ENMSTYSETL SSYLTAQLPA LSLSEHEERI VHYMIACLDE DGYLRVNIEE IAERFAISKQ EAEKALQIIQ SLEPAGVGAR NLQECLYLQL KRLPYRDEFA EQIIQHHFSL FVEKAWKTLA KQLGVDIASL QRVWDLIRTL EPRPGIHYTK ERPHFIVPDI IVQRSEEGDW RIFYNEDVHP ELIWNRGYEQ KISSCQDGQV HAFVKDKYRQ FLWLAKSLEQ RKQTLLNIMH VIVDKQKQCF ETGFAALKPL TMREVAEELG IHESTVSRAV KNKYVQAPFG TVELRRFFSS AVSSVYMDED AASSVKVKMF IRQLIEQENK QEPLSDQKLA DLLHEQYGVV ISRRTVAKYR EQLHIPSSAK RKQYVGK
|
| |