Gene RPB_0598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0598 
SymbolnusA 
ID3908291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp670952 
End bp672571 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content66% 
IMG OID637882487 
Producttranscription elongation factor NusA 
Protein accessionYP_484220 
Protein GI86747724 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA GCGCCAACAA GCTGGAATTG CTGCAGATCG CCGACGCGGT GGCGCGGGAG 
AAATCGATCG ACCGCGGCAT CGTGATCGCC GCGATGGAAG ACGCGATCGC GAAGGCGGCG
CGCGCCCGCT ACGGCTCGGA GACCGACGTC CACGCCGAGA TCGACGCCAA GAAGGGCGAA
TTGCGGCTGT CGCGCCATAT GCTGGTGGTC GAAAACGTCG AGAACCCCGC CAACCAGATC
TCGCTGAAGG CCGCGCAGCG CGCCAACCCC GGCGCGCAGA TCGGCGACAC CATCGCCGAC
ACGCTGCCGC CGCTGGAATA CGGCCGCATC GCCGCGCAGT CGGCCAAGCA GGTGATCGTG
CAGAAGGTGC GCGAGGCCGA GCGTGACCGG CAATACTCGG AATTCAAGGA TCGCATCGGC
GACATCGTCA ACGGCGTCGT CAAGCGCGTC GAATACGGCA GCGTGATCGT CGATCTCGGC
CGCGGCGAGG CGATCGTGCG CCGCGACGAG ATGCTGCCGC GCGAATCGTT CCGCAACGGC
GACCGCGTCC GCGCCTACAT CTTCGACGTC CGCCGCGAGA CCCGCGGCCC GCAGATCTTC
CTGTCGCGCA CTCATCCGCA GTTCATGGCC AAGCTGTTCG CCCAGGAAGT GCCGGAAATC
TACGACGGCA TCGTCGAGAT CAAGGCGGTC GCCCGCGATC CGGGCTCGCG CGCCAAGATC
GGCGTGGTGT CGCGGGATTC CTCGGTCGAT CCGGTCGGCG CCTGCGTCGG CATGCGCGGC
TCGCGCGTTC AGGCCGTGGT CAACGAGCTG CAGGGCGAGA AGATCGACAT CATCCCGTGG
TCGCCGGACA TCGCCACTTT CGTGGTCAAC GCGCTGGCGC CGGCCGAAGT CTCGAAGGTC
GTGATCGACG AAGATCGCGA GCGCATCGAG GTTGTGGTTC CCGACACCAA TAACCAATTA
TCCCTTGCGA TCGGCCGTCG CGGCCAGAAC GTTCGTTTGG CCTCGCAGCT CACCGGCTGG
GACATCGACA TCCTGACCGA GACCGAGGAA TCCGAGCGCC GCCAGGCCGA TTTCGAGAAT
TCGACCCGGG TGTTCATGGA AGCGCTGAAC GTCGACGAAG TGGTCGGCCA GCTGCTCGCC
TCCGAGGGGT TCACCTCGGT CGAGGAACTG GCGCTGGTCG ATATCCGCGA ACTGGCGTCG
ATCGAGGGTT TCGACGAGGA AACCGCGACC GAGCTGCAGG CCCGCGCCAG CGAATATCTC
GATCGGGTGG AGACGGAAAT GGAGGCGCGG CGCCTGGAAC TCGGCGTCGA GGACGCCCTC
AAGGACGTCC CCGGCATCAC CTCGAAGATT CTGGTCAAGC TCGGCGAGGG CGACGTCAAG
ACGGTCGAGG ATCTGGCCGG CTGCGCCACC GACGATCTGG TCGGCTGGAC CGAGCGCAAG
GAAGGCGCCG AGCCGGTGAA GTTCGCCGGC ATTCTCGACG GCGTCGAGGG CGTCACGCGC
GACGAGGCCG AAGACCTGAT CATGCAGGCC CGCGTCAAGG CCGGCTGGAT CACCGAGGAG
GAACTCGCCA GCAGCAAGGG CGAGGCCGCC ATTGCCGAGA CCGAAGCCGA GGCGGAGTGA
 
Protein sequence
MAVSANKLEL LQIADAVARE KSIDRGIVIA AMEDAIAKAA RARYGSETDV HAEIDAKKGE 
LRLSRHMLVV ENVENPANQI SLKAAQRANP GAQIGDTIAD TLPPLEYGRI AAQSAKQVIV
QKVREAERDR QYSEFKDRIG DIVNGVVKRV EYGSVIVDLG RGEAIVRRDE MLPRESFRNG
DRVRAYIFDV RRETRGPQIF LSRTHPQFMA KLFAQEVPEI YDGIVEIKAV ARDPGSRAKI
GVVSRDSSVD PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDIATFVVN ALAPAEVSKV
VIDEDRERIE VVVPDTNNQL SLAIGRRGQN VRLASQLTGW DIDILTETEE SERRQADFEN
STRVFMEALN VDEVVGQLLA SEGFTSVEEL ALVDIRELAS IEGFDEETAT ELQARASEYL
DRVETEMEAR RLELGVEDAL KDVPGITSKI LVKLGEGDVK TVEDLAGCAT DDLVGWTERK
EGAEPVKFAG ILDGVEGVTR DEAEDLIMQA RVKAGWITEE ELASSKGEAA IAETEAEAE