Gene RPC_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4100 
Symbol 
ID3973189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4556373 
End bp4558052 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content64% 
IMG OID637927204 
Productmethyl-accepting chemotaxis sensory transducer with Pas/Pac sensor 
Protein accessionYP_533945 
Protein GI90425575 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCT GGCAACGCAA TTCCGACGCC GATAGTGCTG CGGCGCAACT CGAGGCGATC 
GGGCGATCGC AGGGCGTCAT CGAGTTCGGC ATGGACGGCA GCATCCTCAC GGCCAACCGG
AATTTTCTCG ACGTGCTGGG CTACACGCTG GCGGAAATTC AGGGCAAGAA TCACAGCCTG
TTCGTCGATC CGGCCGAGCG CGGCAGCGCC GCGTATCGCG GGTTCTGGGC GGCGTTGCAG
CGCGGCGAAT TCCAGGCGGC GGAGTACAAG CGGATCGGCA AGGCCGGCAG GGAAGTCTGG
ATTCAGGCCT CCTACAATCC GATCCTCGGC CGCGACGGCA AGCCGTTTCG AATCGTCAAA
TTCGCCACCG ACATCACCGC GCGGAAACTC CGCAGCATGG AAGACGCCGG CAAGATCGCC
GCGATCCAGC GCGCGCAAGC CGTGGTCGAA TTCAAGCTCG ACGGCACCAT CATCACCGCC
AACGACAATT TCTTGAAGGC GATGGGCTAC ACGCTCGACG AAATCGCCGG CAAAAATCAC
AGCATCTTCG TCGAGTCTGC GACCCGCGAC GGCGCGGCGT ATCGCGAGTT TTGGGCGGCG
CTGAACCGCG GCGAATATCA GGCCGCCGAA TACAAGCGGA TCGGCAAGGC CGGCCGCGAG
GTCTGGATTC TTGCGACCTA CAATCCGATC ATCGACGAGA CCGGCAAGCC GATCAAAGTC
GTCAAATTCG CCACCGACGT CACCGCGCAA AGGCTCGGCA CCGCCAATCT CGCCGGTCAA
GTCGAAGCGA TCGGCAAGTC GCAAGCGGTG ATCGAATTCG GCATGGACGG CACTATCCTC
ACGGCCAACG ACAATTTTCT GAACGCGATG GGCTACACGC TGTCCGAGAT CAAGGGCCGC
CACCACAGCA TCTTCGTGGA GCCGGCCGAG CGCGACAGCC CGGCCTACGG CGCGTTCTGG
GCCGGGCTCA ACCGCGGCGA ATTCCAAGCC GCCGAATACA AGCGGATCGG CAAGCACGGC
CGCGAGGTCT GGATCCAGGC GTCCTACAAT CCGATCCTCG ATCTCAACGG CAAGCCGTTC
AAGGTCGTCA AATACGCCAC CGACACCACC GCGCAGGTGA TCGCCCGGCT GAAAAGCGAC
CGGGTCCGCG CCATGATGGA GTCGGTCGCC GCGGGGGCCG AGGAATTGAA CGCCTCGGTG
CGCGAGATTT CCGAGGCGAT GACCAAATCG CGCAACACCG CGATGTCGGC CGTCAGCAAG
GTGGAATCGG CGGACGAACA GGCGCAGCGC CTCAGCGTCG CCGCGCAGGC GATGGGCGGC
ATCGTCGACC TGATCGGCAA CATCACCGGT CAGATCAATC TGTTGGCGCT CAACGCCACG
ATCGAATCCG CGCGCGCCGG CGAAGCCGGC CGCGGCTTCG CGGTGGTGGC GTCCGAGGTG
AAGAGCCTCG CCAACCAGGC CAAGCAAGCC ACCGACAGGA TCGGTAGCGA GATCGCCGGC
CTGAACGGCA TTTCCGGCGA TGTGATCAGC GCCCTGCTTG CGATCAAGAC GGAAATCCAG
AACGTCAGCG AATACGTCAC CGCCACCGCG GCGGCGGTCG AGGAGCAGAG CACGGTGACC
AGCGAGATGT CGTCCAGCAT GCAGCGCGCC GCCGCCGAAG CCGCCAGCAT CGGGCGCTAA
 
Protein sequence
MSFWQRNSDA DSAAAQLEAI GRSQGVIEFG MDGSILTANR NFLDVLGYTL AEIQGKNHSL 
FVDPAERGSA AYRGFWAALQ RGEFQAAEYK RIGKAGREVW IQASYNPILG RDGKPFRIVK
FATDITARKL RSMEDAGKIA AIQRAQAVVE FKLDGTIITA NDNFLKAMGY TLDEIAGKNH
SIFVESATRD GAAYREFWAA LNRGEYQAAE YKRIGKAGRE VWILATYNPI IDETGKPIKV
VKFATDVTAQ RLGTANLAGQ VEAIGKSQAV IEFGMDGTIL TANDNFLNAM GYTLSEIKGR
HHSIFVEPAE RDSPAYGAFW AGLNRGEFQA AEYKRIGKHG REVWIQASYN PILDLNGKPF
KVVKYATDTT AQVIARLKSD RVRAMMESVA AGAEELNASV REISEAMTKS RNTAMSAVSK
VESADEQAQR LSVAAQAMGG IVDLIGNITG QINLLALNAT IESARAGEAG RGFAVVASEV
KSLANQAKQA TDRIGSEIAG LNGISGDVIS ALLAIKTEIQ NVSEYVTATA AAVEEQSTVT
SEMSSSMQRA AAEAASIGR