Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1047 |
Symbol | |
ID | 3831853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1078015 |
End bp | 1079103 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828975 |
Product | NusA antitermination factor |
Protein accession | YP_429904 |
Protein GI | 83589895 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000154988 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000137146 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAACAGTG AGTTTATCCA GGCACTACGG GACCTGGAGA GGGAAAAGGG TATTAACGCC GATATTTTAC TGGAGGCCAT TGAAGCGGCT TTAATCTCGG CTTACAAGAA GAACTTTGGC TCCCTGCAGA ATGTCAGGGT GGACATCCAG CGTGATACCG GGGAGATTAA GGTTCTGGCC CAGCGCCAGG TGGTAGAAGA GGTAACCGAT CCCCGGCAGG AGATCTCCCT CGAGGAAGCT CGGGCCATCA ACAGTAAATA TGAACTGGGG GACATAGTGG AGAAAGAGGT TACTCCCAGG GATTTTGGCC GCATCGCTGC CCAAACGGCC AAACAGGTGG TCGTCCAGCG CATCCGGGAA GCCGAGCGCG GCTTGATTTA TGAAGAATTT ATCGGCCGGG AAAATGACCT CGTTACGGGT GTAGTCCAGC GCCAGGAGGG CAAAAACATT ATCCTTGACC TGGGCCGGGC CGAGGCGATC CTGCTTCCCA GCGAACAGAG CCCCGGAGAG ACCTACCGCC AGGGCGAACG CCTGAAGGTC TATGTCCTGG AGGTCAGGAA GACTAACAAA GGGCCCCAGA TTCTCGTGTC CCGGACCCAT CCCGGCTTGA TAAAGAGGCT TTTCGAGCTG GAAGTTCCGG AAATCCACGA TGGCATTGTT GAAATTAAGG GAGTCGCCAG GGAACCTGGG GCGCGCTCCA AGATTGCCGT TCATTCCCGG GATGAAAAGG TGGATCCGGT GGGCTCCTGC GTAGGTCCCA AGGGGGCACG GGTACAGGCT GTGGTCCAGG AGCTGCGGGG CGAGAAGGTA GATATCATTA AATGGAGCGA TGACCCGGCT GTTTATGTGG CCAACTCCTT GAGCCCGGCC CGGGTCCTGG ACGTGACTGT CGACGAAGAA AATAAGGTGA GCCAGGTCAT CGTTCCTGAT AACCAGCTCT CCCTGGCCAT TGGTAAGGAA GGCCAGAATG CCCGCCTGGC AGCCAGGATC ACCGGCTGGA AAATCGATAT TAAACCGGAA TCCGAAGCTG GCGATTGGGA TTCCTGGGAT GCCGACCTGG ATCTTGACGG CACGATAGAG GAGGAGTAA
|
Protein sequence | MNSEFIQALR DLEREKGINA DILLEAIEAA LISAYKKNFG SLQNVRVDIQ RDTGEIKVLA QRQVVEEVTD PRQEISLEEA RAINSKYELG DIVEKEVTPR DFGRIAAQTA KQVVVQRIRE AERGLIYEEF IGRENDLVTG VVQRQEGKNI ILDLGRAEAI LLPSEQSPGE TYRQGERLKV YVLEVRKTNK GPQILVSRTH PGLIKRLFEL EVPEIHDGIV EIKGVAREPG ARSKIAVHSR DEKVDPVGSC VGPKGARVQA VVQELRGEKV DIIKWSDDPA VYVANSLSPA RVLDVTVDEE NKVSQVIVPD NQLSLAIGKE GQNARLAARI TGWKIDIKPE SEAGDWDSWD ADLDLDGTIE EE
|
| |