Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3379 |
Symbol | |
ID | 7977135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3405888 |
End bp | 3407126 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644800146 |
Product | putative transcriptional regulator, PucR family |
Protein accession | YP_002951285 |
Protein GI | 239828661 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [T] Signal transduction mechanisms |
COG ID | [COG2508] Regulator of polyketide synthase expression |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.670117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCATC ACGATGACCC ATTTCAAGGG AACTTCGACA GCCTCGAAGA ATTTGCGGAC CATATTAGCG ATCTGTTGCA ATGTCCGATT ACGATTGAAG ACGCAAACCA TCGGCTCATC GCCTATAGCG CTCACGATGA CTATACCGAT CCGGCGCGTA CCGCAACGAT TATCAGCCGG CGCGTACCCG AAAAAGTGAT TAATAGTTTA TGGAAGCAAG GGGCGATCCC GGCGCTATTA AGCAGTCGTG ACCCTGTCCG TGTCGCGGCC ATTTCCGAAG TCGGTCTCGG CAGTCGTGTC GCCGTCTCGA TTTGGAAAAA CGACGAAGTA ATCGGATTTA TTTGGGCATT GGAAACAGAC CGCACGTTAT CGAAAGAGGA TATGGAGTTG TTAAAAAAAG CAGCAAAAGC AGCGAAAAAT AAAGTATTGC AGCTGTATAT GCGCAAAAAT AAAAAAGAAG AACGGGTTCA GGAATTTTTT TGGAAGCTGT TGACAGGACA TATGACGACA GAGGAAGAAA TTAAAGAAAA TTTCGAGATG ATGCAAATCC CGACCGCTCC TTTATTTTCC GTCATCGTTT TTCGGCTTGC CAACGAAATT ACGAGAGAGA TTGAAAAACA AATTTCTTAT CTGTTACAGA CAACTCAACA AATTCAACTT CTTCTGTATA CGACAGACCG GAGTGATGTC ATTTTATTGG CGGCTCCGCA AACGACAACT CAACCGCTGC AAGAGTTTAA CTGCTTTATC CAATCATTCG CGACGAAAAT GAAGGAGCGA TTTCACATTA GCAGCATTCA AAGCGGCTTC GGCGGTATAT ACGAAACGTA TACATGCATC GAAAAAAGCT ACAAGGAAGC TTTAACAGTG CTGAAAATGA AAGAAAAGTT TCCGGCGCAA ATCAGCTCCA TTTATGGATA TCAACAATTG GGCATTTATC AATTTTTTGA TTTATTGTTA GAGAAAAAAC GACAAGGAGA GTTTACCAAC ACGGCTTTAC TCAAATTACA ATCGTATGAT CAAAAACACC ATAGCGATTT AGTCGAAACA TTTGAAATCT TTTTCGACCA CGACAGCAAT GTCAATGAAA CGGCGAAAGC GTTAAACATT CATCCAAACA CCCTTGCTTA CCGACTAAAA CGCATCGCCG AAATTGGTGA AATTGACTTA CATGATATCA ATCAAAAAGT AAAACTATAT ATCGATATTA AACTTGCCAA ATATGAAGCG CTCCATTAA
|
Protein sequence | MTHHDDPFQG NFDSLEEFAD HISDLLQCPI TIEDANHRLI AYSAHDDYTD PARTATIISR RVPEKVINSL WKQGAIPALL SSRDPVRVAA ISEVGLGSRV AVSIWKNDEV IGFIWALETD RTLSKEDMEL LKKAAKAAKN KVLQLYMRKN KKEERVQEFF WKLLTGHMTT EEEIKENFEM MQIPTAPLFS VIVFRLANEI TREIEKQISY LLQTTQQIQL LLYTTDRSDV ILLAAPQTTT QPLQEFNCFI QSFATKMKER FHISSIQSGF GGIYETYTCI EKSYKEALTV LKMKEKFPAQ ISSIYGYQQL GIYQFFDLLL EKKRQGEFTN TALLKLQSYD QKHHSDLVET FEIFFDHDSN VNETAKALNI HPNTLAYRLK RIAEIGEIDL HDINQKVKLY IDIKLAKYEA LH
|
| |