Gene Rsph17029_3976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3976 
Symbol 
ID4898813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1116535 
End bp1117929 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content72% 
IMG OID640114579 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001045826 
Protein GI126464713 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.371255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC GCGCCTCTCC CGATCCGCAG GCGGGGATCT CGCGCCTGTC CGGCTCCTCC 
TCGCTGCGGC TGGCGGCGCG GCTTGCGCTG ATCTTCGTGG GGGCCACGCT TCTGGCCGGG
CTCGTGTCGG TCCCGCTGCT GACGGGCGCG TTGAAGGACC GGCTGCACGC GGATGCCCGG
CAAATGGCGG AGTCGCTGGC CGCGACATGG CAGGTGGCGG GGCTCGTCGA TCTGCACGCC
CAGATCGCGG CGAATGTGGC CACCACGCGC GACTTCGCGA ACCTCTATCT CTTCATCGAC
AATTCCGGCC GGATCGTCTT CGGCAACTTC CACCTGCCCG AGCCCTTCGT CGGGTCGCGC
GAGCTCGTCT CCGGGCGCGA CATGCTGCTG CCGGGCGCGG CGCCCGACGG GATCGGCTTC
TCGGCCTACG GCCTGCGCAT CCCGGCCGGC TATGTGATCG CGGCCCGCCA GACGAGCGCG
CTCGACGAGG TGCGGGCGAT CGTGATCCGG GCGGTGGCCT CGGGGCTGGC GCTGGCGCTT
CTGCTGGCAC TGGGGGTGGT GGGGATCCTC GCCTGGGGGG CGGAACGGCG GATCGCGCGG
CTCTCGAGCG TGCTCACGCG GGCGTCGGCC GGCGATCTGT CCGAGCGGGT GCAGGATGCG
GGCAGCGACG ACATCGGCCG TATCGCCGGC GCCGTGAATG CGACGCTCGA CCAGCTGGAG
CTGACGGTCG AGAGCCTCCG GCAGGTCTCC TCCGACGTGG CGCACGATCT GAGGACGCCG
ATCACGCGCC TGCGCACCTC GCTCGAACCC TTGATGCTGC GCGAGGATCT GCCCGAGGAT
GCCGCGGCGG ACATCGCCTC GGCGGTCGGT CAGGCGGACC GGATCGTGCG GATCTTCAAC
GCCGTGCTGC GCATCGCCCA GATTGAGGGC GGCGGCGCCC GCCCGCGCTT CGCGAGGCTC
GATCTGGGGC GGCTTGCCGA CGATCTCCAC GAACTGCTGG AGCCGGTGGC CGAGGAGATG
GGTCACCGCA TCACCGCCCG GATCGAGCCC GTGGCGGTCG AGGGCGACCG CGATCTGCTC
GCGCAGGCCA TCTCGAACCT TGTCGAGAAT GCCTTCCGCC ACTGCCCGCC GCCCGCCCAT
GTCGCGCTGA CGGTGAGGCG CGAGGGGGCG GAGGCGGTGG TGACGGTCGA GGATGACGGG
CCGGGCATTC CCGCGGCCGA GCGCGAGGCG GTCTTCCGCC GCTTCTACCG GCTCGAGCGG
AGCCGCAACA GCGAGGGAAG CGGCTTGGGC CTCAGCCTCG TCGCCGCGGT GGCGCGCCTT
CACGGCGGCC GCGTCGAGCT GCACGACGCG GAGCCCGGCC TGCGCGTTAC GCTGCGGCTG
CCGGTCGGAG AGTGA
 
Protein sequence
MSLRASPDPQ AGISRLSGSS SLRLAARLAL IFVGATLLAG LVSVPLLTGA LKDRLHADAR 
QMAESLAATW QVAGLVDLHA QIAANVATTR DFANLYLFID NSGRIVFGNF HLPEPFVGSR
ELVSGRDMLL PGAAPDGIGF SAYGLRIPAG YVIAARQTSA LDEVRAIVIR AVASGLALAL
LLALGVVGIL AWGAERRIAR LSSVLTRASA GDLSERVQDA GSDDIGRIAG AVNATLDQLE
LTVESLRQVS SDVAHDLRTP ITRLRTSLEP LMLREDLPED AAADIASAVG QADRIVRIFN
AVLRIAQIEG GGARPRFARL DLGRLADDLH ELLEPVAEEM GHRITARIEP VAVEGDRDLL
AQAISNLVEN AFRHCPPPAH VALTVRREGA EAVVTVEDDG PGIPAAEREA VFRRFYRLER
SRNSEGSGLG LSLVAAVARL HGGRVELHDA EPGLRVTLRL PVGE