Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0598 |
Symbol | nusA |
ID | 3908291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 670952 |
End bp | 672571 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882487 |
Product | transcription elongation factor NusA |
Protein accession | YP_484220 |
Protein GI | 86747724 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCA GCGCCAACAA GCTGGAATTG CTGCAGATCG CCGACGCGGT GGCGCGGGAG AAATCGATCG ACCGCGGCAT CGTGATCGCC GCGATGGAAG ACGCGATCGC GAAGGCGGCG CGCGCCCGCT ACGGCTCGGA GACCGACGTC CACGCCGAGA TCGACGCCAA GAAGGGCGAA TTGCGGCTGT CGCGCCATAT GCTGGTGGTC GAAAACGTCG AGAACCCCGC CAACCAGATC TCGCTGAAGG CCGCGCAGCG CGCCAACCCC GGCGCGCAGA TCGGCGACAC CATCGCCGAC ACGCTGCCGC CGCTGGAATA CGGCCGCATC GCCGCGCAGT CGGCCAAGCA GGTGATCGTG CAGAAGGTGC GCGAGGCCGA GCGTGACCGG CAATACTCGG AATTCAAGGA TCGCATCGGC GACATCGTCA ACGGCGTCGT CAAGCGCGTC GAATACGGCA GCGTGATCGT CGATCTCGGC CGCGGCGAGG CGATCGTGCG CCGCGACGAG ATGCTGCCGC GCGAATCGTT CCGCAACGGC GACCGCGTCC GCGCCTACAT CTTCGACGTC CGCCGCGAGA CCCGCGGCCC GCAGATCTTC CTGTCGCGCA CTCATCCGCA GTTCATGGCC AAGCTGTTCG CCCAGGAAGT GCCGGAAATC TACGACGGCA TCGTCGAGAT CAAGGCGGTC GCCCGCGATC CGGGCTCGCG CGCCAAGATC GGCGTGGTGT CGCGGGATTC CTCGGTCGAT CCGGTCGGCG CCTGCGTCGG CATGCGCGGC TCGCGCGTTC AGGCCGTGGT CAACGAGCTG CAGGGCGAGA AGATCGACAT CATCCCGTGG TCGCCGGACA TCGCCACTTT CGTGGTCAAC GCGCTGGCGC CGGCCGAAGT CTCGAAGGTC GTGATCGACG AAGATCGCGA GCGCATCGAG GTTGTGGTTC CCGACACCAA TAACCAATTA TCCCTTGCGA TCGGCCGTCG CGGCCAGAAC GTTCGTTTGG CCTCGCAGCT CACCGGCTGG GACATCGACA TCCTGACCGA GACCGAGGAA TCCGAGCGCC GCCAGGCCGA TTTCGAGAAT TCGACCCGGG TGTTCATGGA AGCGCTGAAC GTCGACGAAG TGGTCGGCCA GCTGCTCGCC TCCGAGGGGT TCACCTCGGT CGAGGAACTG GCGCTGGTCG ATATCCGCGA ACTGGCGTCG ATCGAGGGTT TCGACGAGGA AACCGCGACC GAGCTGCAGG CCCGCGCCAG CGAATATCTC GATCGGGTGG AGACGGAAAT GGAGGCGCGG CGCCTGGAAC TCGGCGTCGA GGACGCCCTC AAGGACGTCC CCGGCATCAC CTCGAAGATT CTGGTCAAGC TCGGCGAGGG CGACGTCAAG ACGGTCGAGG ATCTGGCCGG CTGCGCCACC GACGATCTGG TCGGCTGGAC CGAGCGCAAG GAAGGCGCCG AGCCGGTGAA GTTCGCCGGC ATTCTCGACG GCGTCGAGGG CGTCACGCGC GACGAGGCCG AAGACCTGAT CATGCAGGCC CGCGTCAAGG CCGGCTGGAT CACCGAGGAG GAACTCGCCA GCAGCAAGGG CGAGGCCGCC ATTGCCGAGA CCGAAGCCGA GGCGGAGTGA
|
Protein sequence | MAVSANKLEL LQIADAVARE KSIDRGIVIA AMEDAIAKAA RARYGSETDV HAEIDAKKGE LRLSRHMLVV ENVENPANQI SLKAAQRANP GAQIGDTIAD TLPPLEYGRI AAQSAKQVIV QKVREAERDR QYSEFKDRIG DIVNGVVKRV EYGSVIVDLG RGEAIVRRDE MLPRESFRNG DRVRAYIFDV RRETRGPQIF LSRTHPQFMA KLFAQEVPEI YDGIVEIKAV ARDPGSRAKI GVVSRDSSVD PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDIATFVVN ALAPAEVSKV VIDEDRERIE VVVPDTNNQL SLAIGRRGQN VRLASQLTGW DIDILTETEE SERRQADFEN STRVFMEALN VDEVVGQLLA SEGFTSVEEL ALVDIRELAS IEGFDEETAT ELQARASEYL DRVETEMEAR RLELGVEDAL KDVPGITSKI LVKLGEGDVK TVEDLAGCAT DDLVGWTERK EGAEPVKFAG ILDGVEGVTR DEAEDLIMQA RVKAGWITEE ELASSKGEAA IAETEAEAE
|
| |