Gene RPD_3788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3788 
Symbol 
ID4024304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4227536 
End bp4229854 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content66% 
IMG OID637963992 
Productphytochrome 
Protein accessionYP_570910 
Protein GI91978251 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0317187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCGG GCGTTGACAA TTTAGCCGAG TTGAAATTCA GCGCAATCGG GCCGACCGCG 
TGCGACCGCG AGGCGATTCA TCTCGCCGGC TCGATCCAGC CGCACGGCGC TTTGCTCGCG
GTGACGCCAG AGGATCTGCA GATCGTCCAT GCCGGCGGCG AAACTGCAGC GCTGTTGGGC
GTTTCCATCG AATCGCTGGC TGGAGTCTCT GCGTTCACGA TTTTCTCGGG CGACCAGATC
GCGCGGCTGC GCGCCCTGTT GGGCCCCGAT CGCAAGATTG AACGGCCTCT GCACGCCTTC
ACGCTGAAAG CCGCCGATGC GACACCGGTC GACGTCGTGG TTCACCAGGC CTCCGGCCTG
CTCGTGCTGG AGTTCGAGCC ACGCCGCGAG CCAGCTCCAA ACAATTCGCT CGCGCTCGTG
CAATCGATGA TCCGTCACGT GCAGCGCGCC GGAACCGTGC AGGCGTTTTG CGACGCGGTG
GTCGCCGAGG TTCGCGCCGT CACCGGCTTC GATCGGGTGA TGATCTATCG CTTCATGCCG
GATTGCAGTG GCGAGGTGAA AGCGGAGTCG CGCGCGTCCG GAATGGAGAG CTTCCTCGGG
CTGCGGTATC CGGAGTCCGA CATCCCGAAG CAGGCCCGCG CGCTGTATCT CGCGAACTGG
ATCCGCGCGA TCCCCGACGC CCGCTACGCG CCGGCGCGGA TCGTGCCTAT GATCGATCCT
CGCACCGGAC TGCCGCTCGA TCTAAGCCAG AGCGTGATCC GCAGCGTATC GCCGGCGCAC
CGGCTGTATC TCGCCCATAT GGGCGTGGTG GCTTCAATGT CGCTTTCGAT CATTCAGCAT
GGCAGGCTGT GGGGCCTGAT TGCCTGCCAT CACAGCTCGC CGCGCTATCT ACCCTATCGG
ATGCGGGAGG CGTGCGAGCT TTTCGCCGAG ATGGCGTCAT CGCAGCTCGA GGCCAAGGTC
GCCGCCGAAC AACTCGAGGC ACGGCTGCGC AGCACGCGGA TTCACGAAGA GCTGGTGACG
CGGATGAGCC AGGAATCCGA CCTCGCCGAG GGCCTGATCA GGTTTCATCC CAATCTGCTC
GACTTCATTC CGGCGACCGG CGTCGGATTG TGGGTCGACG GTCAATTCAC CGGCCTCGGC
GTCACGCCCG ATGCCGCGCA AACCGAAGCA CTGATCGGCT GGCTGACGGC CACTGCGAAT
GACGGGGTAT TTCAAACCGA CGCCCTGCCG CTGATCTATC CGCCGGCAAA AGCCTTTGCC
GATTGCGCCA GCGGCCTGAT GGCGCTGTCG CTGTCGAAAT CGCCACGCGA CTACGTGCTG
TGGTTCCGGC CCGAAGTGGT GCGCACCGTC ACCTGGGCGG GCAATCCAAA CAAGCCGGTC
GGCGTCGGTC CCGATGGCGG CTTCACGAAC CCGCGCCGCA GCTTCGCCGC ATGGCAGGAA
TCGGTGCGGC TGCATTCCGA GCCGTGGCGC GCCTCCGACA TCGAAGCCGC CCACCGGCTG
AGGCTGTCGC TGCTGGAAGT GGTGCTGCGG CGGATCGACG GCATCGCCCG CGAGCGCAAA
TCGGCGCGGC TGCTGCAGGA GCAACTGATG CGGCAGGTCG AGATCGGCCT GCGCCGGTCG
CAGGACGTCG CCAAGACGCT GCGGGAAGAG ACCCGCCGGC GGGTCTCGGT CGAGGCCGAC
CTGTCGCAGG TGCTGCGCCG GACGGTCGAG GATCAGGAAG CCGAGCGGCT GCGAATCGCG
CGCGAACTGC ATGACACGCT CGGCCAGTCG CTGACGCTGC TGCAGCTCGG CTTCGAGAAT
CTCGGGCAGG TCGCACCCGA CAATGGCGAA TTGCAGCGCC GCATCGCCGG CATCAAGACC
CTCACGGCCG AGATCGGCCA ACAGGTCAAC CGGCTGGCCT GGGAAATCCG GCCGACCGCG
CTCGACGATC TCGGGATCCA GACCGCGGTC CAGCATCTGC TCGACGCATG GAGCGAGAAG
TCGCAAGTGC AGTTCGATCT GCACATGACG CTCGGCGACC GCCGCCTGCC GCCCGCGATC
GAGACCACTC TTTATCGCGT GCTGCAGGAA GCGCTGACCA ACATCGTCCG CCACGCCGCC
GCGAGCCATG TCAGCGTCAT TTTGCGGCTG TCGGATCGGC AGGTGACGAT GGTGGTCGAG
GACGACGGCC GTGGCTTCGT CAATCCCGAC GCCGCCCGCC CGCCGGAGCG GCTCGGCCTG
CTCGGCATTC GTGAGCGGCT GACGCTGGTC GGCGGCTCGC TCGAAATCGA ATCGGCGCCC
GGCAGGGGCA CCGCTTTGTT CGCTCGAATT CCGTTGTAA
 
Protein sequence
MHSGVDNLAE LKFSAIGPTA CDREAIHLAG SIQPHGALLA VTPEDLQIVH AGGETAALLG 
VSIESLAGVS AFTIFSGDQI ARLRALLGPD RKIERPLHAF TLKAADATPV DVVVHQASGL
LVLEFEPRRE PAPNNSLALV QSMIRHVQRA GTVQAFCDAV VAEVRAVTGF DRVMIYRFMP
DCSGEVKAES RASGMESFLG LRYPESDIPK QARALYLANW IRAIPDARYA PARIVPMIDP
RTGLPLDLSQ SVIRSVSPAH RLYLAHMGVV ASMSLSIIQH GRLWGLIACH HSSPRYLPYR
MREACELFAE MASSQLEAKV AAEQLEARLR STRIHEELVT RMSQESDLAE GLIRFHPNLL
DFIPATGVGL WVDGQFTGLG VTPDAAQTEA LIGWLTATAN DGVFQTDALP LIYPPAKAFA
DCASGLMALS LSKSPRDYVL WFRPEVVRTV TWAGNPNKPV GVGPDGGFTN PRRSFAAWQE
SVRLHSEPWR ASDIEAAHRL RLSLLEVVLR RIDGIARERK SARLLQEQLM RQVEIGLRRS
QDVAKTLREE TRRRVSVEAD LSQVLRRTVE DQEAERLRIA RELHDTLGQS LTLLQLGFEN
LGQVAPDNGE LQRRIAGIKT LTAEIGQQVN RLAWEIRPTA LDDLGIQTAV QHLLDAWSEK
SQVQFDLHMT LGDRRLPPAI ETTLYRVLQE ALTNIVRHAA ASHVSVILRL SDRQVTMVVE
DDGRGFVNPD AARPPERLGL LGIRERLTLV GGSLEIESAP GRGTALFARI PL