Gene RPD_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0233 
SymbolnusA 
ID4020691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp270895 
End bp272517 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID637960412 
Producttranscription elongation factor NusA 
Protein accessionYP_567374 
Protein GI91974715 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.157572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA GCGCCAACAA GCTTGAATTG CTGCAGATCG CCGACGCGGT AGCGCGGGAG 
AAATCGATCG ACCGCGGCAT CGTGATCGCG GCGATGGAAG ACGCGATCGC CAAGGCCGCG
CGCGCCCGCT ACGGCTCGGA GACCGACGTT CACGCCGAGA TCGACGCCAA GAAGGGCGAG
CTGCGGCTGT CGCGCCACAT GCTGGTCGTC GATAAGGTCG AGAACGCCGC CAACCAGATT
TCGCTGAAGG ACGCGCAGCG CGCCAATCCC GGCGCGCAGA TCGGCGACAC CATCGCCGAC
ACCCTGCCGC CGCTGGAATA CGGCCGCATC GCCGCGCAGT CGGCCAAGCA GGTGATCGTG
CAGAAGGTGC GCGAGGCCGA GCGCGACCGG CAATACATGG AGTTCAAGGA CCGCATCGGC
GACGTCGTCA ACGGTGTCGT CAAGCGCGTC GAATACGGCA GCGTGATCGT CGATCTCGGC
CGCGGCGAGG CGATCGTGCG CCGCGACGAG ATGCTGCCGC GCGAATCGTT CCGCAACGGC
GACCGCGTCC GCGCCTACAT CTTCGACGTT CGCCGCGAGA CCCGCGGCCC GCAGATCTTC
CTGTCGCGCA CCCACCCGCA ATTCATGGCG AAGCTGTTTC AGCAGGAAGT GCCGGAAATC
TACGACGGCA TCGTCGAGAT CAAGGCGGTC GCCCGCGATC CCGGCTCGCG CGCCAAAATC
GGCGTGGTGT CGCGCGACAG CTCGGTCGAT CCGGTCGGCG CCTGCGTCGG TATGCGCGGT
TCGCGCGTCC AGGCGGTGGT CAACGAGCTG CAGGGCGAGA AGATCGACAT CATCCCGTGG
TCGCCCGACA TCGCGACCTT CGTGGTCAAC GCGCTGGCCC CGGCGGAAGT CTCGAAAGTC
GTGATCGACG AAGACCGCGA GCGGATCGAG GTTGTCGTTC CGGACACCAA TAACCAACTA
TCCCTTGCGA TCGGTCGTCG CGGCCAGAAC GTCCGTCTGG CGTCGCAGCT CACCGGCTGG
GACATCGACA TTCTGACCGA GACCGAGGAA TCCGAGCGCC GCCAGGCCGA TTTCGAGAAT
TCGACCCGGG TGTTCATGGA AGCGTTGAAC GTCGACGAAG TGGTCGGCCA GCTGCTCGCC
TCCGAAGGCT TCACCTCGGT CGAGGAACTG GCGATGGTCG ATATCCGCGA ACTGGCCTCG
ATCGAAGGTT TCGACGAGGA GACCGCGACC GAATTGCAGG CTCGCGCCGC CGAATATCTC
GACCGCGTCG AGACCGAGCT GGAAGCGCGG CGGCAGGAAC TCGGCGTCGA GGACGCGCTC
AAGGACGTCC CCGGCGTTAC CTCGAAGATG CTGGTCAAGC TCGGCGAGGG CGACGTCAAG
ACGGTCGAGG ATCTGGCCGG CTGCGCCACC GACGATCTGG TCGGCTGGAC CGAGCGCAAG
GAAGGCGCCG AGCCGGTGAA GTATGCTGGC ATTCTCGACG GCGTCGAGAT GACGCGCGAC
GACGCCGAAC ATCTGATCAT GCAGGCCCGC GTCAAGGCCG GCTGGATCAC CGAGGAAGAA
CTCGCCCAGA CTGCCGACAA GGGCGAGGAC GCCGGTGCGG AGACCGAAGG CGCGGCGGAG
TAA
 
Protein sequence
MAVSANKLEL LQIADAVARE KSIDRGIVIA AMEDAIAKAA RARYGSETDV HAEIDAKKGE 
LRLSRHMLVV DKVENAANQI SLKDAQRANP GAQIGDTIAD TLPPLEYGRI AAQSAKQVIV
QKVREAERDR QYMEFKDRIG DVVNGVVKRV EYGSVIVDLG RGEAIVRRDE MLPRESFRNG
DRVRAYIFDV RRETRGPQIF LSRTHPQFMA KLFQQEVPEI YDGIVEIKAV ARDPGSRAKI
GVVSRDSSVD PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDIATFVVN ALAPAEVSKV
VIDEDRERIE VVVPDTNNQL SLAIGRRGQN VRLASQLTGW DIDILTETEE SERRQADFEN
STRVFMEALN VDEVVGQLLA SEGFTSVEEL AMVDIRELAS IEGFDEETAT ELQARAAEYL
DRVETELEAR RQELGVEDAL KDVPGVTSKM LVKLGEGDVK TVEDLAGCAT DDLVGWTERK
EGAEPVKYAG ILDGVEMTRD DAEHLIMQAR VKAGWITEEE LAQTADKGED AGAETEGAAE