Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2273 |
Symbol | |
ID | 3831384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2380468 |
End bp | 2381469 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637830193 |
Product | LacI family transcription regulator |
Protein accession | YP_431103 |
Protein GI | 83591094 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0311079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000018406 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTCACTA TTAAAGACGT AGCCAGACAT GCCGGGGTGT CGGTTACTAC GGTCTCCCGG GTCCTCAACA ACAGCCAGCA TCCTATCAGT CCTGCTACCA AACAGCGCGT TCTAGCAGCC ATCGAAGAAC TCGGCTTTTG CCCCAACGCC GCGGCCCGCA GCCTGCAGCT CAATGAAACC AGGACCATTG GCCTTATCCT GCCGGATATC GCCAACCCCT ACTACCCCGG CATCGTCCGG GGCGTCGAGG ACGTGGCCCA TGAATCCGGT TACACGGTTA TCCTCTGCAA TACCGACCGT TCCCGTGAAC GTACTCAGGA ATACTTAAGA GTGCTACGGG AAAAGCGGGT GGACGGAGTA ATCTTTACCG GCGGCGGGGC CGTGGAAGAC GCCAGCCAGA GCCACTTTTT CGACCAGGAA AGAATAGCTA CCGTGGTTAT CGGACGTCAC CGTGGCAAAC TGCCGGCGGT GCAGGTGAAT AATACCCTGG CGGCACGGGA GGCGGTAGAG CATCTGCTAT CCCTGGGACA CAGGCGTATC GCAACTATCA CTGGACCGGC GACCTCAACT ACAGCCAGTG ATCGCCTGGA TGGTTACCGG TGTGCCCTGG CCGGGCGAGG TATGAAAGTA GACCCCCTTT TAATCGTTGA AGGTAATTTT GAATTCGAGA GCGGGTACCA GGCTATTGAC CGGCTGCCCC TCAGGGGCCC CGGGGCTATA ACGGCTATCT TTGCCCATAA CGATTTGATG GCTATCGGGG CTATGAAGGC CCTTCAGGAA CGGGGCTTAC AGGTTCCCGG CGATATAGCA GTTATGGGCT TTGACAATAT TCCCCTCGCT TCCTTTATCA CCCCCCAGCT TTCTACCGTT GCTGTTCCTG TCTATGACCT GGGGGTAACG GCCATGAAGG TGCTGGCTGA GCTCCTCGCC GGCCGGGAGG TGCCGCCGGT CACCACCCTG GCTACCAAGC TCCAGGTCCG GGACTCTACT ATAATTAAAT AA
|
Protein sequence | MVTIKDVARH AGVSVTTVSR VLNNSQHPIS PATKQRVLAA IEELGFCPNA AARSLQLNET RTIGLILPDI ANPYYPGIVR GVEDVAHESG YTVILCNTDR SRERTQEYLR VLREKRVDGV IFTGGGAVED ASQSHFFDQE RIATVVIGRH RGKLPAVQVN NTLAAREAVE HLLSLGHRRI ATITGPATST TASDRLDGYR CALAGRGMKV DPLLIVEGNF EFESGYQAID RLPLRGPGAI TAIFAHNDLM AIGAMKALQE RGLQVPGDIA VMGFDNIPLA SFITPQLSTV AVPVYDLGVT AMKVLAELLA GREVPPVTTL ATKLQVRDST IIK
|
| |