Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2024 |
Symbol | |
ID | 3831399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2112161 |
End bp | 2113207 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829953 |
Product | LacI family transcription regulator |
Protein accession | YP_430863 |
Protein GI | 83590854 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCAAATA TCTCTGATGT TGCCAGAAGA GCCGGAGTAT CACGAACTAC GGTATCCAGG GTCCTCAACG GGAAAGACGA CGTCAACGAA GAAACCCGGC GCCGGGTACT GGAGGCCATT AAGGAGCTAA ACTACCGCCC CAGTGCCCTG GCCCGCAGCC TGGTGAAACA AAAGACCGAT ACCATTGGCG TTATCCTGTC GGATATCACC GATCCCTTCT TTTCCCTTAT TATCCAGGGG GTGGAAGATG TAGCTCATAA ATTCGGTTAC GGCATAGTTT ACGCCAGTAT GCGCTGGGAC CCGCAAATCA AGCATAACTA TGTAAGTTTC TTGCGCAACG GGCGGGTGGA CGGTCTCCTT ATGATGGGCC ATACAGTGGG CAATGAAGAC TATGTACGGG AGATGGTAGA GGATAAGTTT CCCCTGGTCC TGGTTGAATA TTGGATTGAG AATCTCAAGG CCAACTTTAT ATCCATTGAT AACAAAGGGG GGGGTTACCT GGCCACCAGG CACCTGCTGG GACTTGGGCA CCGCCGCATT GCCCATGTAG CAGGGCATAA AAACGCCCGG GTATCGCAAG AACGCCTGGC CGGTTACCGC CAGGCCCTGG AGGAAGCCGG CAGGCCCTAC GACGAAAGCC TGGTAGTTTA CAGCGATTTT ACCACCGAAG GAGCCATACC GGTTGCGAAA AAACTATTAT CCCTGCCTGA GCGGCCGACA GCCATCTTTG CGGCCAACGA CCTGATGGCC TACGGCGTCA TCCATGCGGC CCGCGAACTG GGATTAAGGG TTCCAAAGGA TCTGGCGGTG GTCGGCTACG ACGACATCGA GCTGGCTTCC CTAGTAACAC CACCCTTAAC AACCATCCAT CAGCCCCGCT ATGAAATCGG CTCCATGGCC GCCTGGTCCC TGATCCAGCA GATCGAGAAT AAAGAAATGC AGCCGACGGT AACAGAGTTT AAGACGAGCC TGGTAATTCG GGAATCCTGC GGCGCTGTTT TCCGGCGCGG CGACCAGGGG TATGTAGAAG GAGGCAAGCC GGCATGA
|
Protein sequence | MPNISDVARR AGVSRTTVSR VLNGKDDVNE ETRRRVLEAI KELNYRPSAL ARSLVKQKTD TIGVILSDIT DPFFSLIIQG VEDVAHKFGY GIVYASMRWD PQIKHNYVSF LRNGRVDGLL MMGHTVGNED YVREMVEDKF PLVLVEYWIE NLKANFISID NKGGGYLATR HLLGLGHRRI AHVAGHKNAR VSQERLAGYR QALEEAGRPY DESLVVYSDF TTEGAIPVAK KLLSLPERPT AIFAANDLMA YGVIHAAREL GLRVPKDLAV VGYDDIELAS LVTPPLTTIH QPRYEIGSMA AWSLIQQIEN KEMQPTVTEF KTSLVIRESC GAVFRRGDQG YVEGGKPA
|
| |