Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2204 |
Symbol | nusA |
ID | 8447815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2433088 |
End bp | 2434098 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645041326 |
Product | transcription elongation factor NusA |
Protein accession | YP_003201570 |
Protein GI | 258652414 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000000400476 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000554149 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACGTCG ACATTGCCGC ACTGCGGGCG GTCGAGAAGG AAAAGGGCAT CGACTTCAAC TCGTTGATCG ACACCCTGGA GACGGCGCTG CTGACCGCCT ACCGGCACAC CCAGGGACAC GCCGGGCACG CCCGGGTGCA GATCGACCGC AAGACCGGCG GGATCCGGGT CTGGGCCCAG GAGACCGACG ACTCCGGCGA GATCGTGCGC GAATGGGACG ACACCCCGGA GGACTTCGGC CGGATCGCCG CGACCACCGC CCGCCAGGTC ATCCTGCAGC GGCTGCGGGA CGTGGACAAC GAACGCACCT TCGGCGACTT CGCCGGCCGC GAGCACGACC TGATCACCGG CACCATCTCG GCCGACGCGA AGGTCAACGC GCGCGGTGTG GTCGTGGTCA GCCTGGGCGA GGGCAAGAAC GCGGTGGAGG GCATCATCCC GGCCGCCGAA CAGGTGCCGG GGGAGTCCTA CCCGCATGGC CAGCGGCTGC GCTGCTACGT CGTCTCGGTC GCCCGCGGGC TGCGCGGGCC GCAGATCACC CTGTCCCGCA CGCACCCGAA TCTGGTCCGC AAGTTGTTCG CCCTCGAGGT GCCCGAGATC GCCGACGGCT CGGTGGAGAT CACTGCGGTG GCCCGCGAGG CCGGTCACCG GTCCAAGATT GCGGTGCGGC CGGCGGTGGC CGGGGTCAAC GCCAAGGGTG CCTGTATCGG CCCGATGGGC GCGCGGGTCC GTGCTGTGAT GAGCGAACTC AACGGCGAGA AGATCGACAT CATCGACTAC GACGAGGACC CGGCCACGTT CGTGGGCAAC GCGCTCTCGC CGGCGCGGGC GCTGTCCACC ACGGTGATCG ACCCGGTGGC CAAGGCCGCC CGGGTGGTGG TCCCGGACTT CCAGTTGTCC CTGGCCATTG GCAAGGAGGG GCAGAACGCC CGGCTCGCCG CCCGGCTCAC CGGTTGGCGG ATCGACATCC GCAGCGACGC CGCCCCGGAC CCGGCCGGCG AGGGTCGATG A
|
Protein sequence | MNVDIAALRA VEKEKGIDFN SLIDTLETAL LTAYRHTQGH AGHARVQIDR KTGGIRVWAQ ETDDSGEIVR EWDDTPEDFG RIAATTARQV ILQRLRDVDN ERTFGDFAGR EHDLITGTIS ADAKVNARGV VVVSLGEGKN AVEGIIPAAE QVPGESYPHG QRLRCYVVSV ARGLRGPQIT LSRTHPNLVR KLFALEVPEI ADGSVEITAV AREAGHRSKI AVRPAVAGVN AKGACIGPMG ARVRAVMSEL NGEKIDIIDY DEDPATFVGN ALSPARALST TVIDPVAKAA RVVVPDFQLS LAIGKEGQNA RLAARLTGWR IDIRSDAAPD PAGEGR
|
| |