Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Svir_14550 |
Symbol | nusA |
ID | 8386788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharomonospora viridis DSM 43017 |
Kingdom | Bacteria |
Replicon accession | NC_013159 |
Strand | + |
Start bp | 1502015 |
End bp | 1503061 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644975534 |
Product | transcription elongation factor NusA |
Protein accession | YP_003133322 |
Protein GI | 257055490 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01952] NusA family KH domain protein, archaeal [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0106294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGTCG ACATCGCGGC GTTGCGCGCG ATCGAAGCGG ACAAGGACAT CCCCTTCGAG ACGGTGCTGG AAGCCATCGA GAGCGCGTTG CTCACGGCGT ATAAGCACAC CGAGGGCCGC CAGCCGCACG CCAGGATCGA CATCGACCGG AAGACCGGGT ACGTGCGCGT GATCGCGTAC ACGCTCGACG AGAACGGCGA GGTCGTCGAG GAGTGGGACG ACACCCCCGA GGGGTTCGGC CGGATCGCTG CCGCCACCGC TCGACAGGTC ATCCTGCAGC GTCTGCGGGA TGCGGAGCAC GAGAAGACCT ACGGCGAGTT CTCGGCACAG GAAGGCGAGA TCGTGGCCGG CGTCGTGCAG CGCGATGCCA AGGCCAACGC CAGGGGCATG GTGATCGTCC AGGTCGGCGA CACCGAGGGT GTGCTGCCTC CCGCGGAGCA GGTGCCGGGC GAGGTGTACG AGCACGGTGC GCGGCTCAAG GCGTACGTCG TGGGGGTGTC CCGCACCGCC CGTGGGCCGC AGATCACGCT GTCGCGCACC CATCCGAACC TGGTGCGGAA GCTGTTCGCT CTTGAGGTCC CCGAGATCGC CGACGGCACT GTCGAGATCA CGGCCGTGGC CAGGGAGGCG GGCCACCGGT CGAAGATCGC CGTTCGGTCC ACCGTGCCCG GCGTCAACGC CAAGGGCGCT TGCATCGGGC ACGTGGGCGC TCGAGTGCGC AACGTCATGA GCGAGCTCGG TGGCGAGAAG ATCGATATCA TCGATCACTC GGACGACCCG GCGCGTTTCG TCGGGAATGC TCTGTCGCCC GCGAAGGTTG TATCTGTCGA CGTGGTCGAC GAGCGGACCA AGACGGCCCG GGTCATCGTG CCGGACTTCC AGTTGTCTCT GGCAATCGGT AAGGAAGGTC AGAATGCGCG GCTCGCGGCC CGACTCACCG GGTGGCGGAT CGACATTCGC AGTGATGCGG CCCCCGAACC CGGGGAGCAG CAGCACATGG ACCCGGCCGA GAAGCGGCAT CCGGCAGCGA CCGGTTCCGC TGACTGA
|
Protein sequence | MNVDIAALRA IEADKDIPFE TVLEAIESAL LTAYKHTEGR QPHARIDIDR KTGYVRVIAY TLDENGEVVE EWDDTPEGFG RIAAATARQV ILQRLRDAEH EKTYGEFSAQ EGEIVAGVVQ RDAKANARGM VIVQVGDTEG VLPPAEQVPG EVYEHGARLK AYVVGVSRTA RGPQITLSRT HPNLVRKLFA LEVPEIADGT VEITAVAREA GHRSKIAVRS TVPGVNAKGA CIGHVGARVR NVMSELGGEK IDIIDHSDDP ARFVGNALSP AKVVSVDVVD ERTKTARVIV PDFQLSLAIG KEGQNARLAA RLTGWRIDIR SDAAPEPGEQ QHMDPAEKRH PAATGSAD
|
| |