Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2366 |
Symbol | |
ID | 3832546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2488965 |
End bp | 2489924 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637830285 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_431191 |
Protein GI | 83591182 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000000181907 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000001 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGCCATGC ATGACCAGTA TCAAACCGGG ATATGGGATG GAGGTACGGC CCGGGCGCAA AAGCGGCGCA GTTCCCTGTC CTTGATTACC GGCTTATTAA TCCTGGCCGT TTTAACCTTC GCTGCCTTTG CCTACGCCGG CTACCGGGTA GCCGACCTGT GGCTGGCCGG GAACCTGGCT ACGGGGCAGG GCCAGCAGGA CGGGAGCGGC ATCGCCGAAG CGGCTCCAGG CAACCCGGAG ACCATTCTCC TCATCGGCGT CGACCAGCGG GTGCCTGGCG AACCCTCCCG GTCGGATACC ATTATGCTGG CCGTCCTGGA CCCGCAAAAA CCCGGGGTCG ACCTCCTTTC CATTCCCCGG GACACGCGAG TCAAGATCCC CGGCCACGGT TACGACAAGA TTAACGCCGC CCACGCCTAC GGCGGCCCCC AGCTATTGAT GGACACCATC AACGATTTTC TGGGCAGCCA CGTGGACAAG TACGTGGAGG TCAATTTCCA GGATTTCCAG AAGATCATCG ACATTCTGGG CGGGGTCGAT ATCGACGTCG ACAAGCGCAT GTACTACCCG GAAGAGAACA TAAACTTAAA GCCGGGGTTC CAGCACCTGA ACGGCTATGA CGCCCTGGCC TACGTCCGCT ACCGCTACGA CCCCGAGGGG GATATCACCA GGGTCGGGCG CCAGCAGAAG TTTATGAAAG CCCTCATCGA CCAGACCTTG AAACTGAGCA CCATACCCAA AATACCCAAA CTGGTCAGCG AAATCAGCAA GGACGTCAAG ACAAACCTCA GCGTCAAAGA AATGCTCTCC CTGGCCCTGA GCATGAAGGA TTTAAACGGT AGCGCCGTCA ATACCTACAC GATCCCCGGC GAAGGCAAAT ACGTGGGCGG CGTGAGTTAC TATATCGTCG ATCAGCAGAA GCTGACTGAT GTTCTAGCGT CCATTCCCAA CCTGCGCTAG
|
Protein sequence | MAMHDQYQTG IWDGGTARAQ KRRSSLSLIT GLLILAVLTF AAFAYAGYRV ADLWLAGNLA TGQGQQDGSG IAEAAPGNPE TILLIGVDQR VPGEPSRSDT IMLAVLDPQK PGVDLLSIPR DTRVKIPGHG YDKINAAHAY GGPQLLMDTI NDFLGSHVDK YVEVNFQDFQ KIIDILGGVD IDVDKRMYYP EENINLKPGF QHLNGYDALA YVRYRYDPEG DITRVGRQQK FMKALIDQTL KLSTIPKIPK LVSEISKDVK TNLSVKEMLS LALSMKDLNG SAVNTYTIPG EGKYVGGVSY YIVDQQKLTD VLASIPNLR
|
| |