Gene RPD_2589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2589 
Symbol 
ID4023085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2901035 
End bp2902207 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content63% 
IMG OID637962786 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_569719 
Protein GI91977060 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.206184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0145748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG TCCTTGAACA GCGCCACGCC AAGCCGACCG AAAGCGATGC GATCCTCAAC 
GCGCTGCCCA ATCCCGTGCT GCTGATCGGG CCGGACGGCA AGATCATCGA TGCCAACATG
GCGGCGGAAT CGTTCTTCGA GATTTCGACG CAGCTCTTAC GGCGGCAATC ACTGACCGAG
CTGGTGCCGT TCGGCAGTCC GCTCCTGGCG CTGATTGACC AGGTCCGCAG CGGCAATTCG
CCGGTCAACG AGTACAAGGT CGATCTCGGC ACGCCGCGGA TCGGTTCCGA TCGCCAGGTC
GATCTGCACG TCGCGCCGCT GAACGAGCGT CCGGGGCATA TTGTCGTGAT GCTGCAGGAG
CGTACCATCG CGGACAAGAT GGACCGGCAG CTCACCCATC GCAGCGCCGC GCGCTCGGTG
ATCGCGCTGG CGGCGATGCT CGCGCACGAG ATCAAAAACC CGCTGTCCGG CATCCGCGGC
GCGGCGCAAT TGCTCGAGCA GCAGGCGTCG TCGGAAGACC GGATGCTGAC GCGGCTGATC
TGCGACGAGG CCGACCGCAT CGTCACCCTG GTCGATCGCA TGGAAGTGTT CGGCGACGAC
CGCCCGGTGG CGCGCGGGCC GGTCAACATT CATTCCGTGT TCGATCACGT CAAACGGCTG
GCGCAGTCCG GCTTCGCACG CAACATCAAA TTCGTCGAGG ACTACGACCC GTCGCTGCCG
CCGGTGCTCG CCAATCAGGA TCAGCTGATT CAGGTGTTTC TCAACCTCGT GAAGAACGCC
GCCGAAGCCG TTGTCGATCT CGGGAGCGAC GCCGAGATTC ATCTCACGAC CGCGTTTCGT
CCCGGCGTGC GGCTGTCGGT GCCGGGCAAA AAGACTCGTG TGTCACTGCC GCTGGAATTC
TGCGTCAAGG ACAACGGTCC CGGCGTGCCG GAAGACCTAT TGCCGAATCT GTTCGATCCG
TTCGTCACCA CCAAGGCGTC GGGATCCGGG CTCGGGCTCG CGCTGGTCGC CAAGATCGTC
GGCGATCACG GCGGAATCAT CGAGTGTGAA TCGCAGCCAC GCAAGACCTC GTTCCGCGTG
CTGCTGCCGA TGTTCAGCAC GGCGAAGAAC GGCAATCAAA GCAACGGCGA GGACGTGCCG
GCGTCATCCC ATGCCTCTCA GACTGCAAGA TGA
 
Protein sequence
MTVVLEQRHA KPTESDAILN ALPNPVLLIG PDGKIIDANM AAESFFEIST QLLRRQSLTE 
LVPFGSPLLA LIDQVRSGNS PVNEYKVDLG TPRIGSDRQV DLHVAPLNER PGHIVVMLQE
RTIADKMDRQ LTHRSAARSV IALAAMLAHE IKNPLSGIRG AAQLLEQQAS SEDRMLTRLI
CDEADRIVTL VDRMEVFGDD RPVARGPVNI HSVFDHVKRL AQSGFARNIK FVEDYDPSLP
PVLANQDQLI QVFLNLVKNA AEAVVDLGSD AEIHLTTAFR PGVRLSVPGK KTRVSLPLEF
CVKDNGPGVP EDLLPNLFDP FVTTKASGSG LGLALVAKIV GDHGGIIECE SQPRKTSFRV
LLPMFSTAKN GNQSNGEDVP ASSHASQTAR