Gene RPD_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2403 
Symbol 
ID4022892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2676970 
End bp2679300 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content63% 
IMG OID637962594 
ProductATP-binding region, ATPase-like 
Protein accessionYP_569534 
Protein GI91976875 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0308443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGC GGAGAACGCC AGCGGCGTCC GATCCCACCG CCTTTATCCC GGGCGACGAC 
GTTCTGGCGG CCGACCTCAC CGAATGTGAT CGCGAGCCGA TTCACATCCC GGGCTCGATT
CAGCCTCATG GCATGCTGAT CGCGCTTGCC GAACCCGAGT TACGTCCCGT CTCGGTGAGC
GCCAACATTC AGGCGATGAT CGGAATTCCC CCGGAGACCG CGCTGGCAGA GCCGTTGTCC
GCCTATATGA CCGCGGAGAG TTTCGCGCTG CTACGCGCCG CGCTGTCCGA CGACGACGTC
GCCTCGACCA ACCCGATCCG CCTGGAGTTT TTGACGAAAT CTGCCCCGAT CGCTTTCAAC
GGCATTCTCC ATCGACATGA TGGCCTCTGC ATTCTGGAAC TAGAGCCGCG AGGCGCGACC
GAATCCTCGA GCGAGTTCTT CCGCAGCGTT CGCGCGGCGA TCCGACGGTT GCAGAAGGCG
TCCAGCGTTC TGATGGCCTG CGACATCGCA GCGCGCGAGG TGCGGCGGAT CACCGGGTTC
GACCGCATCA AGATCTATCG ATTTGCAGCG GACTGGAGCG GCCAGGTCAT CGCCGAGGAC
CGCGTCGACA GTATCCCGTC GCTGCTGGAT TTCCACTTCC CCGCCTCTGA TATCCCGGCG
CAGTCGCGCG CGCTCTACAC CAGCAACACC ATCCGGATCA TCCCCGACGT CGCCTATCGC
CCGTCTCCGC TGCTCCCGGA TCGCAATCCG GTGACGGGCG GGCCGATCGA TCTGAGCTAC
GCGGTGCTGC GCAGCGTCTC GCCGGTCCAT GTCGAGTACA TGGTCAACAT GGCGGTGTAC
GGCGCGATGT CGGTGTCGAT CGTCCGTGAG GGCCGCCTTT GGGGAATGAT CTCGTGCCAC
AATACCTTGC CCCGGTTCGT CCCCTTCGAA ATCCGGCAGG CTTGTGAACT GATCGCCCAG
GTCCTGACCT GGCAGATCAG CGTGCTGGAA GAAGCCGAGA TCGTTCAGCA CAGCGTGCAG
GTGCGCGCCA TCCAGAACAA GCTGCTGCAC CAATTCGGCG ACGAGCAGAA CCTGCATGAA
GGGCTGACCC GCATCGGCGA TGAGATGCTC GCGCTGATGA GCGCGTTCGG ATTCGCTCTT
TGCGGATTCG ACGGCATCAC GTCTTTCGGA CGAACGCCGT CGCCGGCCCA GTTGCAGGAC
CTCGCGAACT GGATCGCCCG GAATCAGCCG AGCGGAGCGT TCGAGACGGA CCGGCTCTCG
ACGCTCTACG CCGATGCCTC AAGCTATCGG GAGATCGCCA GCGGCGTGCT TGCGATCCCG
CTGGGCCGGG CATCGACGAG TCTGTTGTTG TGGTTTCGAC CCGAGGTCGC CCAGACCGTC
ACCTGGGGCG GCGATCCGCA CAAGCCGGTG CAGATCGGTC CGCGCGGCCG ACGTCTGCAA
ACCCGCGCGT CGTTCGATGC GTGGCGCGAA GAGGTGCGTG GCCGCGCGAT TCCATGGCGC
AGCCACGAGA TCGCCGCTGC CATCGAGATT CGCGATTTGG TCGTGGACGT CATTCTCGGC
AAGGCGGAGC AATTGGAGAG GGCGAACCGC GAACTATCCC GGAGCAACGA CGAGCTGGAA
TCTTTCGCCT ACGTCGCCGC TCATGACCTC AAGGAGCCGC TCCGCCACAT CGAAGCATTC
GCCGGCCTGC TGAGCGATCT CCTCGCGCCC GAGGCCAAGA CCCGGCTGAA TGTCATGGTC
AACGGAATTG AAGCGTCGTC CCGGCGCCTG CGCGCCCTGA TCAACGATCT AGCGGAGTAT
TCGCGCGTCG GACGACAGGC GCGGCCGCTC GCTCCGATAT CGTTGAACGA GGTTTTGTCC
GAAGTCCTCG CGGACTTGAA GCCGAATTTG CAGGATACCC GCGCGGCGGT GTCGGCGGAC
GATCTCCCAG TGGTGCTGTG TGATGCCAGC CAGATCAGGC AATTGTTGCA GAACCTGATT
TCCAACGCGC TCAAATATCG CGATGCGTTT CGTCCGCCGC AGATCGAGAT CAGTTCTGCG
GTGGATACCG AGACCAAGGA TAGTCACGAT CGTCATCCAC GCGTGAGAGT CACGATCTCG
GATAATGGAA TCGGCTTCGA CCCAAAATAT GCCGAGCAGA TCTTCGAGCC CTTTCAGAGG
CTGCACGGCC CGGATGAATA CGAAGGCACC GGCATCGGGC TGGCGATCTG CCGCAAAATC
GTGAGCCGCC ACGGCGGCTC GATCACTGCG ACGAGCATGC CCGGAAGCGG ATCGGCCTTC
AGCTTCACCC TTTCACTGCG CGGCGCCGAC GATGTGGAGC AGACGTCATG A
 
Protein sequence
MNERRTPAAS DPTAFIPGDD VLAADLTECD REPIHIPGSI QPHGMLIALA EPELRPVSVS 
ANIQAMIGIP PETALAEPLS AYMTAESFAL LRAALSDDDV ASTNPIRLEF LTKSAPIAFN
GILHRHDGLC ILELEPRGAT ESSSEFFRSV RAAIRRLQKA SSVLMACDIA AREVRRITGF
DRIKIYRFAA DWSGQVIAED RVDSIPSLLD FHFPASDIPA QSRALYTSNT IRIIPDVAYR
PSPLLPDRNP VTGGPIDLSY AVLRSVSPVH VEYMVNMAVY GAMSVSIVRE GRLWGMISCH
NTLPRFVPFE IRQACELIAQ VLTWQISVLE EAEIVQHSVQ VRAIQNKLLH QFGDEQNLHE
GLTRIGDEML ALMSAFGFAL CGFDGITSFG RTPSPAQLQD LANWIARNQP SGAFETDRLS
TLYADASSYR EIASGVLAIP LGRASTSLLL WFRPEVAQTV TWGGDPHKPV QIGPRGRRLQ
TRASFDAWRE EVRGRAIPWR SHEIAAAIEI RDLVVDVILG KAEQLERANR ELSRSNDELE
SFAYVAAHDL KEPLRHIEAF AGLLSDLLAP EAKTRLNVMV NGIEASSRRL RALINDLAEY
SRVGRQARPL APISLNEVLS EVLADLKPNL QDTRAAVSAD DLPVVLCDAS QIRQLLQNLI
SNALKYRDAF RPPQIEISSA VDTETKDSHD RHPRVRVTIS DNGIGFDPKY AEQIFEPFQR
LHGPDEYEGT GIGLAICRKI VSRHGGSITA TSMPGSGSAF SFTLSLRGAD DVEQTS