Gene RPB_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3985 
Symbol 
ID3911792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4551395 
End bp4553593 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content65% 
IMG OID637885889 
Productputative PAS/PAC sensor protein 
Protein accessionYP_487589 
Protein GI86751093 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.525883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.142655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCTG AAGCGACGAT CTCCCCCGCA TTCGGAACCG CCGATCTTTC GAACTGCGAG 
CAGGAGCAAA TTCATCTTGC CGGTTCGATT CAGCCGCACG GCGCTCTGCT GGTGATCAGC
GAGCCGGATC ACCGTATCGT TCAAGCCAGC GCCAACGCCG CCGAGTTCCT CAACGTCGAG
CGGGTGCTCG GTCTGCCGCT CGCCGAACTC GAAGGCGATC TGCTGATCCG TATCCTGCCT
CATCTCGACC CGACTTCCGA GGGAACGCCG ATCGCCGTGC GCTGCCGGAT CGGCAGCCCG
GGGGCGGATT TCGACGGCTT GATCCATCGG CCGCTGGAAG GGGGGCTGAT CGTCGAGCTG
GAGCGCGCCG GCCCGCCGGT CGATCTGTCG CTGATGGTCG GGCAGGCGCT GGACAAGATC
CGGACCGCGA GCTCGGTTCG CGCCTTGTGC GAAGAAGCTG CGGTGCTGTT CCAGAATCGC
ACCGGCTACG ACCGGGTGAT GATCTACCGT TTCGACGAAG AGGGCCACGG CGAGGTGTTC
TCCGAGCGCC ACGTCCCGGG GCTCGAATCC TATCTCAGCA ACCGTTATCC GTCCTCCGAC
ATTCCGCAAA TGGCGCGACG CTTGTACGAG CGAATTCGCG TGCGCGTGCT GGTCGACGTC
GGCTACGAGC CGGTGCCCCT GCAGCCGCGG CTGTCACCTC TCACCGGGCG CGACCTCGAC
ATGTCGAGCT GCTTTCTCCG CTCGATGTCG CCGATCCATC TGCAGTATCT GAAGAACATG
GGCGTCCGCG CGACGCTGGT GGTGTCGCTG GTGGTCGGCG GCAAACTGTG GGGACTGGTC
GCCTGCCATC ACTATCAGCC GCGCTTCATC CATTTCGAGC TGCGCGCCGT CTGCGAGCTG
CTCGCCGAGG CGATCGCCAC GCGGATCACC GCACTCGAGA GCTTTGCGCA AAGCCAGTCC
GAACTGTTCG TGCAGCGGCT CGAGCAACGC ATGATCGAAG CGATCTCGCG GGAGGGCGAC
TGGCGCGCGG CGATCTTCGA CACGACGCAG TCGATCCTGC CGCCGGTCGG TGCGACTGGT
GGCGCGCTGG TGTACGAGGG GCAGGTCACC ACGATCGGTG AAGTGCCAGG CACCCAGGAT
ATTCGCGAGG TCGCCGCCTG GCTCAGCCGC CAGCCCCGCG TGTCGGTGAT CTCGACATCG
TCGCTCGGTC TCGATGCGCC GGAATTCGCG CCGTTGACGC GCGTCGCGAG CGGCGTCGCG
GCGGCGCCGG TCTCCAACCA TCCGGGCGAA TTCCTCATCT GGTTCCGTCC CGAGCGTGTG
CGCACGGTGA CCTGGGGCGG CGATCCGAAA AAGCCTTTCG TGATCGGAGA TACGCCGGCT
GATCTTTCCC CCCGGCGCTC GTTCGCCAAA TGGAATCAGG TGGTCGAAGG GACTTCCGAT
CCGTGGACGC CGGCCGACCT GGCAGCGGTG CGGATGATCG GCCAGTCTGT CGCCGACATC
GTCTTGCAGT TCCGCGCCGT ACGAACGCTG ATCGCCCAGG AGCAACTCGA CCAGTTCTCC
CGTCAGGTGC TGACCTCCGA CCATCCGGTG GTGATCGCCG ACATCGAAGG CAAGATCCTG
CTGATGAACG ACGCGTTCAC ATCGATGCTG CCGCAGGCGC ATCCGCCCGT CCAGCGACTG
GACGATCTCG CCTCGGCCTT CGTCGAACCC TACGATTTCC TGCGGAATGT CGGCGAGCTG
ATCAGTCGAC GGCGCGGCTG GCGGGGCGAA TTGCTGTTGC GCGGCGGGGG AAACGAGCCG
CGTCCGTTGA TGGTGCGGGC CGACCCGGTG GTCCCGTCGC GCGATCGCGC GCTCGGCTTC
GTGCTGATCT TCACCGACAA CACCGACCGC CGCGCTGCTG AAGCGGCGCG TGGCCGTTTC
CAGGAAGGCA TCATCAAAAG CAGTCGGGTC GGCAGCGTGC GGTTGGATTC GAAATCCGAC
CTCGTCTATC AGAATCTGTT GTCGTCGCTG GTCGAGAACG CCCAGCTCGC GGCGCTGGAG
ATCACCTATG GGGTCGAGAC CGGGCGGATT GCCGAAATGC TCGAAGGCGT GCGCAATTCG
ACGCTGCGCA CTGCGGAAGT CCTCGAGTAT TTGATGCTGC ACGCGTCCCG CACCAGCGGG
AACGACAACA CGACAAGAAA CAACAAGTCC AACAGCTAG
 
Protein sequence
MTAEATISPA FGTADLSNCE QEQIHLAGSI QPHGALLVIS EPDHRIVQAS ANAAEFLNVE 
RVLGLPLAEL EGDLLIRILP HLDPTSEGTP IAVRCRIGSP GADFDGLIHR PLEGGLIVEL
ERAGPPVDLS LMVGQALDKI RTASSVRALC EEAAVLFQNR TGYDRVMIYR FDEEGHGEVF
SERHVPGLES YLSNRYPSSD IPQMARRLYE RIRVRVLVDV GYEPVPLQPR LSPLTGRDLD
MSSCFLRSMS PIHLQYLKNM GVRATLVVSL VVGGKLWGLV ACHHYQPRFI HFELRAVCEL
LAEAIATRIT ALESFAQSQS ELFVQRLEQR MIEAISREGD WRAAIFDTTQ SILPPVGATG
GALVYEGQVT TIGEVPGTQD IREVAAWLSR QPRVSVISTS SLGLDAPEFA PLTRVASGVA
AAPVSNHPGE FLIWFRPERV RTVTWGGDPK KPFVIGDTPA DLSPRRSFAK WNQVVEGTSD
PWTPADLAAV RMIGQSVADI VLQFRAVRTL IAQEQLDQFS RQVLTSDHPV VIADIEGKIL
LMNDAFTSML PQAHPPVQRL DDLASAFVEP YDFLRNVGEL ISRRRGWRGE LLLRGGGNEP
RPLMVRADPV VPSRDRALGF VLIFTDNTDR RAAEAARGRF QEGIIKSSRV GSVRLDSKSD
LVYQNLLSSL VENAQLAALE ITYGVETGRI AEMLEGVRNS TLRTAEVLEY LMLHASRTSG
NDNTTRNNKS NS