Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1865 |
Symbol | nusA |
ID | 3786515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2151210 |
End bp | 2152697 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811951 |
Product | transcription elongation factor NusA |
Protein accession | YP_412552 |
Protein GI | 82702986 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.253606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCG AGGTTTTGCT ATTGGTGGAT GCACTGGCGC GCGAAAAGAA TGTGGAAAAG AACATTGTTT TTGCCGCACT TGAGCTTGCA TTGGCGTCCG CCACGAAAAA GCGTTTTAAC GAGGACGCTG ACGTGCGCGT GTCGATCGAC CACCAGACAG GGGACTATCA ATCCTTGCGC CGCTGGCAGG TGGTGGCCGA CGATGCAGTG GAGGATCCCG CTCGTCAGAT ATCGCTGAGC GAGGCCTTTC GGCAAAATCC TGAAATCCAG CTCGACGAAT ATATCGAAGA AATCCTGGAG CCGGTGGAAT TCGGGCGCAT CGGCGCCCAA GCGGCGAAAC AGGTGATATT CCAGAAAATC CGTGATGCCG AACGCGAGCA GATCCTGAAC GATTTTCTCG AGCGCAAGGA ATATATGGTT ACCGGAACCA TCAAGCGCAT GGAACGCGGA AACGCGATCA TCGAATCCGG CAGGGTAGAG GCCGTATTGC CGCGTGACCA GATGATTCCC AAGGAGAATC TGCGTGTCGG GGACAGAGTG CGCGCTTACT TGCACAGGAT CGACCGTACT ACGAGAGGTC CGCAGTTAAT CCTGTCGAGA ATTGTGCCGG AATTTCTGAT AAAGCTGTTT GAGCTGGAAG TGCCTGAGAT CGAGGAGGGC TTGCTGGAGA TCAAGGTGGC AGCGCGTGAT CCTGGTTCGC GTGCGAAGAT CGCAGTGAAG TCGAATGATC AGCGAGTGGA TCCGATCGGT ACCTGCGTAG GCATGCGGGG TTCCAGAGTC CAGGCCGTGA CCGGCGAACT GGCGGGTGAA AGGGTAGACA TCATCCTTTG GTCTGACGAC CCCGCCACAT TCGTAATCAA TGCGCTGGCT CCTGCCGAGA TCAGCAGCAT TCTGGTGGAT GAAGAGAAAC ACAGCATGGA CATTGTCGTG GATGAAGGCA ATCTGGCCCA GGCGATCGGA CGTGGAGGCC AGAATGTCCG GCTCGCCTCG GAATTGACAG GCTGGGAGCT CAACATCATG ACGATGGAAG AGTCACAGGC AAAAAATGAA GAGGAGTTTT CGGTTTTGCG CCATCTCTTC ATGGAAAAAC TGGATGTGGA TGAGGAAGTG GCGGACATTC TGGCGCAGGA AGGCTTTACT ACTCTGGAAG AAGTGGCTTA CGTGCCGCTC AGCGAAATGA TGGAGATCGA AGCATTTGAC GAACAGACCG TCAACGAGCT TCGCAATCGG GCTCGCAACG CATTGTTGAC CGAGGCCATC GTGAGCGAGG AAAAGGTGGA GCACATCGCC GAAGACCTGC TCGCGCTGGA AGGCATGGAC AGCCAGACCG CGCGCGAACT CGCGGCTAAA GGAGTAAGCA CCCAGGAAGA CCTGGCCGAC CTGGCTGTGG ATGATCTGGT GGAAATGGTC GAGATGGATA CTGAACGGGC CAGGCAGTTG ATCATGGCGG CACGTGCCCC GTGGTTTGCC CCGGCCGATA GAGCATAG
|
Protein sequence | MSREVLLLVD ALAREKNVEK NIVFAALELA LASATKKRFN EDADVRVSID HQTGDYQSLR RWQVVADDAV EDPARQISLS EAFRQNPEIQ LDEYIEEILE PVEFGRIGAQ AAKQVIFQKI RDAEREQILN DFLERKEYMV TGTIKRMERG NAIIESGRVE AVLPRDQMIP KENLRVGDRV RAYLHRIDRT TRGPQLILSR IVPEFLIKLF ELEVPEIEEG LLEIKVAARD PGSRAKIAVK SNDQRVDPIG TCVGMRGSRV QAVTGELAGE RVDIILWSDD PATFVINALA PAEISSILVD EEKHSMDIVV DEGNLAQAIG RGGQNVRLAS ELTGWELNIM TMEESQAKNE EEFSVLRHLF MEKLDVDEEV ADILAQEGFT TLEEVAYVPL SEMMEIEAFD EQTVNELRNR ARNALLTEAI VSEEKVEHIA EDLLALEGMD SQTARELAAK GVSTQEDLAD LAVDDLVEMV EMDTERARQL IMAARAPWFA PADRA
|
| |