Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_2312 |
Symbol | nusA |
ID | 4644347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 2469095 |
End bp | 2470096 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639805796 |
Product | transcription elongation factor NusA |
Protein accession | YP_953132 |
Protein GI | 120403303 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01952] NusA family KH domain protein, archaeal [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0988305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0487624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCTC TGCACGCGAT CGAGGCCGAT AAGGGTATCT CGGTCGACGT GGTGGTCGAC ACCATCAAAT CGGCACTGCT GACGGCCTAC CGGCATACCG AGGGCCACGA GGCCGACGCG CACATCGACA TCGACCGCAA GACCGGCGCC GTCAAGGTGA TCGCCCGCCA GACCGACGAA GACGGCAACG TGCTGCACGA ATGGGACGAC ACCCCAGAGG GTTTCGGCCG AATCGCGGCG ACCACCGCGC GTCAGGTCAT CCTGCAGAGG CTCCGCGACG CCGAGAACGA GAAGAACTAC GGCGAGTTCT CGGCGCGCGA GGGCGACATC GTCGCCGGCG TCATCCAGCG TGATGCGCGC GCCAACGCCC GAGGGCTCGT GGTGGTCCGG ATGGGCAGCG AGACCAAGGG TTCCGAAGGT GTGATCCCGG CCGCCGAGCA GGTGCCCGGA GAGCGGTACG AGCACGGCGA CCGGTTGCGG TGCTACGTCG TCGGCGTGAC GCGCGGCGCC AGGGAGCCGC TCATCACCCT GTCGCGAACG CATCCGAATC TGGTGCGCAA GCTGTTCTCC CTGGAGGTTC CCGAGATCGC CGACGGCTCG GTGGAGATCG TCGCGGTGGC CAGGGAGGCC GGCCACCGCT CCAAGATCGC GGTGGCGACC AGGGCGCCAG GGCTGAACGC CAAGGGCGCC TGCATCGGCC CGATGGGGCA GCGGGTGCGC AACGTGATGA GTGAGCTGTC CGGCGAGAAG ATCGACATCA TCGACTACGA CGAGGACCCG GCGCGGTTCG TGGCCAACGC GCTGTCGCCG GCCAAGGTGG TGTCGGTGAC GGTGATCGAC GAGGCCGCCC GTGCGGCGCG CGTCATCGTG CCGGACTTCC AGCTCTCTCT CGCGATCGGC AAGGAGGGCC AGAACGCACG TCTGGCGGCG CGCCTGACCG GATGGCGCAT CGACATCCGC AGCGACGACG CCGCCAAAGA AGGCGCGGTC GAGGAGCGTT GA
|
Protein sequence | MAALHAIEAD KGISVDVVVD TIKSALLTAY RHTEGHEADA HIDIDRKTGA VKVIARQTDE DGNVLHEWDD TPEGFGRIAA TTARQVILQR LRDAENEKNY GEFSAREGDI VAGVIQRDAR ANARGLVVVR MGSETKGSEG VIPAAEQVPG ERYEHGDRLR CYVVGVTRGA REPLITLSRT HPNLVRKLFS LEVPEIADGS VEIVAVAREA GHRSKIAVAT RAPGLNAKGA CIGPMGQRVR NVMSELSGEK IDIIDYDEDP ARFVANALSP AKVVSVTVID EAARAARVIV PDFQLSLAIG KEGQNARLAA RLTGWRIDIR SDDAAKEGAV EER
|
| |