Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1691 |
Symbol | nusA |
ID | 7084111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1897156 |
End bp | 1898631 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643698712 |
Product | transcription elongation factor NusA |
Protein accession | YP_002355342 |
Protein GI | 217970108 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.349056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCG AGATCCTGCT GCTTGTCGAT GCCTTGGCGC GCGAGAAGAA CGTCGCCAAG GACATCGTTT TTTCCGCGCT CGAAACCGCC TTGGCCTCTG CGACCAAGAA ACGCATCCAC GACGATGCCG ACGTGGTGGT GTCGATCGAC CGCGACTCCG GCGACTACAC CTCGAAGCGC CGCTGGCTGG TGATGCTCGA CGAGGAGGTC GCGAACGACG AGGCCGAGAT GGGCATCATC GATGCCCGCG AGCTGCGCGC CGACGTGCAG ATCGGTGACT ACATCGAGGA AGAGCTCGAG CCGATCGACT TCGGTCGCAT CGGCGCCCAG GCCGCCAAGC AGGTCATCCT GCAGAAGATC CGCGACGCCG AGCGCGAGCA GGTGCTCAAC GACTTCCTCG ACCGCAAGGA GTTCCTCGTC TCCGGCTCGA TCAAGCGCAT GGAGCGCGGC AACGCGATCA TCGAGGTCGG CCGCATGGAA GCCGTGCTGC CGCGCGACCA GCAGATCCCG CGCGAGAATC TGCGCGTGGG CGATCGGGTC AAGGCCTTCC TGCTGCGCAT CGACCGCGGC GCGCGCGGCC CGCAGCTGGT GCTGTCGCGC ACCGCGCCCG AATTCCTCAT GAAGCTCTTC GAGCTCGAGG TCCCCGAGAT CGAGGACGGC CTGCTCGAGC TCAAGGCCTG CGCCCGCGAC GCCGGCCTGC GCGCCAAGAT CGCGGTCAAG TCCAACGACC AACGCATCGA CCCGATCGGT ACCTGCGTCG GCCTGCGCGG CTCGCGCGTC ACCGCCGTGC GCAACGAGAT CGCCGGCGAG CAGATCGACA TCATCGTGTG GTCGCAGGAT CCCGCCCAGT TCGTGGTCGC CGCGCTGCAG CCCGCCGAGG TCGTCTCCAT CGTCGTGGAC GAGGAGTCGC ACGCGATGGA CGTGGTGGTC GACGAGAACA ACCTCGCGAT CGCCATCGGC CGCAGCGGCC AGAACGTCAA GCTCGCCTCC GAGCTCACCG GGTGGACGAT CAACCTGATG AGCGAGCAGG AGTCGGCCGA AAAGACCGCC CAGGAGCAGC AGGGCCTGCG CGCGCTGTTC ATGGAAAAAC TGGACGTCGA CGAGGAAGTC GCCGACATCC TGATCGAGGA GGGTTTCTCC TCGCTCGAAG AGGTGGCCTA CGTGCCGCTC TCCGAAATGC TCGAGATCGA GGCCTTCGAC GAGGACACGG TCAACGAACT GCGCAATCGA GCGCGCAATG TGCTGCTGAC CGAGGCCATC GTCACCGAGG AGCAGCTCGA GAAGGTTTCC GACGACTTGC TCGGCCTTGA AGGCATGGAC AAGTCGCTGG CCGCCACACT GGCCCAGCAG GGCATTCGTA CCCGCGACGA CCTGGCCGAC CTTGCGGTCG ACGAGCTGGT CGAAATGGCC GGGATCGACG AAGAAAGAGC CAAGGCGCTG ATTTCCGTTG CGCGCGCCCA TTGGTTCGAA GAATGA
|
Protein sequence | MSREILLLVD ALAREKNVAK DIVFSALETA LASATKKRIH DDADVVVSID RDSGDYTSKR RWLVMLDEEV ANDEAEMGII DARELRADVQ IGDYIEEELE PIDFGRIGAQ AAKQVILQKI RDAEREQVLN DFLDRKEFLV SGSIKRMERG NAIIEVGRME AVLPRDQQIP RENLRVGDRV KAFLLRIDRG ARGPQLVLSR TAPEFLMKLF ELEVPEIEDG LLELKACARD AGLRAKIAVK SNDQRIDPIG TCVGLRGSRV TAVRNEIAGE QIDIIVWSQD PAQFVVAALQ PAEVVSIVVD EESHAMDVVV DENNLAIAIG RSGQNVKLAS ELTGWTINLM SEQESAEKTA QEQQGLRALF MEKLDVDEEV ADILIEEGFS SLEEVAYVPL SEMLEIEAFD EDTVNELRNR ARNVLLTEAI VTEEQLEKVS DDLLGLEGMD KSLAATLAQQ GIRTRDDLAD LAVDELVEMA GIDEERAKAL ISVARAHWFE E
|
| |