Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3779 |
Symbol | nusA |
ID | 3837236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4327448 |
End bp | 4328998 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637827904 |
Product | transcription elongation factor NusA |
Protein accession | YP_428860 |
Protein GI | 83595108 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.530973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGAA CCGTTGGCAT GAGCCGGCCC GAACTAGTCC AGGTGGCCGA TACCGTGGCC CGCGACAAGA GCATCGAGCG CGAGGAAGTG TTCGCCGCCA TGGAGCAGGC CATTCAGAAG GCCGGTCGCT CCAAATACGG CCATGAACAC GATATCCGGG CGCGCATCGA TCGCAAGACC GGCGAAATCC GTCTGGCCCG TTACATCGAA GTGGTCGAGG AGGTGGAGAA CGAATTCACC CAGATCACCC TGGCCGGCGC CAAGCGCAAG AAAGCCGATA TCGAACTGGG TGAATTCCTG GTCGATCCCC TGCCCCCCAT CGACTTCGGG CGGATCGCCG CCCAGACCGC CAAGCAGGTG ATCGTTCAGA AGGTCCGCGA CGCCGAGCGC GAGCGCCAGT TCGCCGAATA CAAGGATCGC CTGGGCGAGA TCATCAATGG TCTGGTCAAG CGCGTCGAAT TCGGCAATGT CATCGTCGAC ATGGGCCGGG CCGAAGCGCT GCTGCGCCGG GACGAGGTCA TCCCGCGCGA GCACTTCAAG AACGGCGACC GCATCCGCGC CTATATCATG GACGTTCGCC AGGAACCCCG CGGCCCGCAG ATCTTCCTGT CGCGGACCCA CGAGCAGTTC ATGGCCAAGC TGTTCGCCCA GGAAGTGCCC GAAATCTACG ACGGCATCAT CGAGATCAAA TCGGTGGCGC GCGATCCCGG ATCGCGCGCC AAGATCGCCG TGCTGTCCAA CGACCACGCC ATCGATCCGG TCGGCGCCTG CGTCGGCATG CGCGGCAGCC GCGTTCAGGC GGTGGTCGCC GAGCTTCAGG GCGAGAAGAT CGACATCATT CAATGGGCGA CCGATCCGGC GACCTTCGTC GTCAACGCCC TGGCCCCGGC CGAGGTCGCC AAGGTCGTGC TTGACGAGGA AGCCAACCGC ATCGAAGTGG TGGTGCCCGA CGAGCAATTG TCGCTGGCCA TCGGCCGGCG CGGGCAGAAC GTCCGTCTGG CCTCGAAGCT GACCGGCTGG GATATCGATA TCCTGACCGA GGCCGAGGAA AGCGAACGCC GCCAGGACGA ATTCCGCATC CGCACCCAGG CCTTCATCGA AGCCCTGGAC GTTGACGACG TCATCGCCCA CCTTCTGGTG ACCGAAGGCT TCACCACCGT CGAGGACGTG GCCTTCGTGG CCATTGCCGA GTTGAGCGAG ATCGAAGGCT TCGACGAGGA CGTGGCCTCC GAGCTGCAGA ACCGCGCCCA GGTCTTCCTG GCCGAACGCG AGCGTTTGAA CGAAGAGCGC CGCAAGGAAC TGGGCGTCGT CGACGAGATG GCCGAAATCG AGGGGCTGAC CGCCGCCATG CTGGTCGCCC TGGGCGAAAA GGGCGTGAAA ACCCTCGACG ATCTCGGCGA TCTGGCCAGC GACGAATTGA TCGAGATCGT TGGCGGCCTG AATGAGGACG ACGCCAACGC CGTGATCATG GCGGCGCGCG CCCATTGGTT CGGCGAAGCC GGCGACTCGG CCGACGACGC GACGGCGAAG ACCCCGGAGC AGGATGCCTG A
|
Protein sequence | MERTVGMSRP ELVQVADTVA RDKSIEREEV FAAMEQAIQK AGRSKYGHEH DIRARIDRKT GEIRLARYIE VVEEVENEFT QITLAGAKRK KADIELGEFL VDPLPPIDFG RIAAQTAKQV IVQKVRDAER ERQFAEYKDR LGEIINGLVK RVEFGNVIVD MGRAEALLRR DEVIPREHFK NGDRIRAYIM DVRQEPRGPQ IFLSRTHEQF MAKLFAQEVP EIYDGIIEIK SVARDPGSRA KIAVLSNDHA IDPVGACVGM RGSRVQAVVA ELQGEKIDII QWATDPATFV VNALAPAEVA KVVLDEEANR IEVVVPDEQL SLAIGRRGQN VRLASKLTGW DIDILTEAEE SERRQDEFRI RTQAFIEALD VDDVIAHLLV TEGFTTVEDV AFVAIAELSE IEGFDEDVAS ELQNRAQVFL AERERLNEER RKELGVVDEM AEIEGLTAAM LVALGEKGVK TLDDLGDLAS DELIEIVGGL NEDDANAVIM AARAHWFGEA GDSADDATAK TPEQDA
|
| |