Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1262 |
Symbol | |
ID | 3833057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1306526 |
End bp | 1307569 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637829198 |
Product | LacI family transcription regulator |
Protein accession | YP_430119 |
Protein GI | 83590110 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000350604 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000118995 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGAACCGT TACCAAGAGG AATGAAAAAT ATGCGCATAA CCATCAAAGA TATAGCCCAA AGAAGCGGGG TATCCTGTTC TACGGTTTCA CGTGTTTTAA CAAACCACCC CAACGTTGAT CCCAAAACCA GGGAACGGGT CAAGCAAGTT ATAGATGAAC TTGGTTACCG CCCCAGTCGC ATTGCCCGCG GTCTGGTTAT GGGCCAAATT AACGTAGTTG CCCTTATAAT TGGCGATATT CGTAACCCTT TCTATGCAGA ACTAACCCGT GCCGTAAAAG ATATCCTGAA TAAAGAAGGT TATATGGTTG TAGTTAGCGA TAGTGATTAT GACCCGCAAA AAGAGGAGAT ATATATTCGG GCAGCCGAGG AATATGGCTT TGCTGGCATT ATCATGATTA CAGCCATGGA AACCGAGGCT TTGATCCAAC AGTTAGAAAA ATTACGCTGC CCGGTTGTTT TGCTCAATCG CTACCTACCC TCCGTCGAGA CAGATGTTAT CTCCGTAGAT AATTATCTAG GCGGTTACCT GGCCGCTGAG CACCTCATCA AGCTAGGGCA TCGTAATATC GCTCACCTGG CTGGTTTTAA AAACTCCAGT GCTACCCGAG ATCGCCTTCG TGGCTTCATC GACGCCCATG TCCATTATGG TATCCAGTTA AATCAAGAAA GAATAGTTTA TGGTAATTTG CAGATGGAAG CAGGGTATAA ATTCGCTAAA GAATATTTAA GCCAGAACGA AGACATTACT GCAGTCTTTT GTGGAAACGA TCTCATGGCC CTGGGATTGA TAGAAGCCCT GTACGAGGAG GGGAAAGAAA TACCCCGAGA TATAAGTGTT ATAGGTTACG ATGATATTGA CATGGCTTCT CTGGCAAGGG TAAAACTCAC GACCATCCGC CAGCCCCAGT ATGAAATGGG CCAGACAGCT GCCGAGGTCT TAATTGACAG AATGAAAGGT AAAATAGGAG CACCAAAACG TATTATCTTT ACACCTAAGC TGATCATCCG TGAGAGTACC GCCGAGTATA AACCCGGTAA ATAA
|
Protein sequence | MEPLPRGMKN MRITIKDIAQ RSGVSCSTVS RVLTNHPNVD PKTRERVKQV IDELGYRPSR IARGLVMGQI NVVALIIGDI RNPFYAELTR AVKDILNKEG YMVVVSDSDY DPQKEEIYIR AAEEYGFAGI IMITAMETEA LIQQLEKLRC PVVLLNRYLP SVETDVISVD NYLGGYLAAE HLIKLGHRNI AHLAGFKNSS ATRDRLRGFI DAHVHYGIQL NQERIVYGNL QMEAGYKFAK EYLSQNEDIT AVFCGNDLMA LGLIEALYEE GKEIPRDISV IGYDDIDMAS LARVKLTTIR QPQYEMGQTA AEVLIDRMKG KIGAPKRIIF TPKLIIREST AEYKPGK
|
| |