Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0699 |
Symbol | |
ID | 7979479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 769369 |
End bp | 770394 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644797683 |
Product | transcriptional regulator, LacI family |
Protein accession | YP_002948857 |
Protein GI | 239826233 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTCA CGATCAAAGA TGTCGCGAAA CGAGCAAACG TGGCTCCCTC CACGGTTTCC CGTGTTATCG CCGACAGTCC GCGCATCAGC GAGAAAACGA AGCGGAAAGT GCGCGAGGCG ATGAAAGAAC TTGGATATCA CCCAAATTTT ATTGCGCGCA GCTTGGCAAA CCAAGCGACG CAAGTCATCG GCATCGTCAT GCCGAGCGCG GCGGAACAGG CGCTGCAAAA CCCGTTCTTT CCAGAAGTCA TCCGCGGCAT CAGCAAGGCG GCGCATGAAA AAAAATACGC GCTGCAAATG TCCACGGGCG AAAAAGAAGT GGAAATTTAT GAAGGGGTCG TCGACATGCT GCAAGGCCGC CGCGTAGACG GGGTGATTTT ATTATACTCG CGCATTGATG ATAAACTGAT GAAATATTTG CAAAAAAACA AATTCCCATT TGTCGTGATT GGGAAGCCGC ATCAAAAGGC GGAACAAATC ACTCATGTCG ATAATGACAA TTACCTGGCG GGCAAAGAGG CGACCGAGTA TTTGATTGCG CGCGGCCATG AACGAATCGC TTTTGTCGGC GGCAATAAGC AATATTTGGT CACCGTTGAC CGACTGAGCG GGTATGAAGC GGCGTTGAAA GAAGCTGGCC TTCCTTATCG TGAAGACTAT ATCATGCATG AAGAATTTCT GCAGGAAGGC GGCCAAGAGG CGATGAAAGA GCTGCTTTCG CTCGCAGACC CGCCGACCGC GCTTGTCGTC GCCGACGACT TAATGGCCTT AGGCGTGCTG AAGACGTTGG ATGAAATGAA CCTGCGCGTT CCCGATGACA TTTCGATCGT CAGCTTCAAC AACACGCTGC TGGCGGAAAT GTCGCGCCCG CCGCTGACAT CGGTGGACAT CCACATTTTC CAGCTCGGCT ATGAAGCAAC GAAAAGTTTA ATCGAAAAAA TCGGCAACCC AAATGAACCA GTCAAGCGCA TTATTATTCC GCATCGCATT GTTGAGCGTT TTTCGTGCAA CGATGGCAGG AAATAA
|
Protein sequence | MTVTIKDVAK RANVAPSTVS RVIADSPRIS EKTKRKVREA MKELGYHPNF IARSLANQAT QVIGIVMPSA AEQALQNPFF PEVIRGISKA AHEKKYALQM STGEKEVEIY EGVVDMLQGR RVDGVILLYS RIDDKLMKYL QKNKFPFVVI GKPHQKAEQI THVDNDNYLA GKEATEYLIA RGHERIAFVG GNKQYLVTVD RLSGYEAALK EAGLPYREDY IMHEEFLQEG GQEAMKELLS LADPPTALVV ADDLMALGVL KTLDEMNLRV PDDISIVSFN NTLLAEMSRP PLTSVDIHIF QLGYEATKSL IEKIGNPNEP VKRIIIPHRI VERFSCNDGR K
|
| |