Gene RPC_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4099 
Symbol 
ID3973188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4554470 
End bp4556137 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content65% 
IMG OID637927203 
Productmethyl-accepting chemotaxis sensory transducer with Pas/Pac sensor 
Protein accessionYP_533944 
Protein GI90425574 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTT GGACGCGCAG CTCCGACGCG GAGAGTGCTG CACAGATGCA AGCCATCAGC 
CGGTCGCAGG GCGTGATCGA GTTCGCCATT GACGGCACCA TTCTCACCGC CAACCAGAAT
TTTCTCGACG CGCTCGGCTA CAGCCTGGCC GAGATCAAGG GCAAGCATCA CCGGATGTTC
GTCGATCCCG ACGAGCGCGA CGGCGCCGCC TACCGCGCGT TCTGGGCGAG CCTTGGCCGC
GGCGAATACC AGGCCGCCGA ATACAAGCGG ATCGGCAAGG GCGACCGCGA GGTCTGGATC
CAGGCCAGCT ACAATCCGAT CATGGATCGC AGCGGCAAGC CGGCCAAAGT CATCAAGTTC
GCCACCGACG TCACCGCGCG CAAGATCCGC GGCATGGAAG ACGCCGGCAA GATCGCCGCG
ATCCTGCGGG CGCAGGCGGT GATCGAGTTC AATCTCGACG GCACCATCAT CACCGCCAAC
GACAATTTCC TCGGCGTGAT GGGCTACGCG TTGGCCGATG TCGTCGGCAA GCCGCACAGC
ATGTTCGTCG AGCCGGCGGA GCGCGACAGC GCCGCGTATC GTGCGTTCTG GGCCGGGCTC
AATCGCGGCG AATATCTCGC CGCGGAGTTC AAGCGGATCG GCAAGGGTGG CCGCGAGGTC
TGGATCCTGG CGTCCTACAA CCCGATCATC GACGAGAAGG GCAAGGTCTT CAAGGTGGTC
AAATTCGCCA CCGAGGTGAC CCAGCAAAAG CTGCGCTTCA CCGAACTCGA CGGCCAGGTG
CAAGCGATCG GCAAGTCGCA GGCGGTGATC GAATTCGCCA TGGACGGCAC CATTCTCGGC
GCCAACGGCA ATTTTCTCGA CGCGATGGGC TATTCGCTGG CCGAGATCAA GGGCCGTCAC
CACAGCATGT TCGTCGACCC CTCGGAGCGC GACGGCGCCG CCTACCGGGC GTTCTGGGCC
ACGCTCAATC GCGGCGAGTT CCAGGCCGCC GAATACAAGC GGATCGGCAA GGGCGGCCGC
GAGGTCTGGA TCCAGGCCAC CTACAATCCG ATTCTCGATC TCAACGGCAA GCCGTTCAAG
ATCGTCAAAT ACGCCAGCGA CACCACCGCC CGGGTGATCG CACGGATCAA GAGCGACCGG
GTCAGCAGGA TGCTGGAGTC GGTGGCGGAC GGCGCCGAGA AGTTGAACGC CTCGGTGCGC
GACATCGCCG AGGCGATGAC CAAATCGAAG TCCACCGCGA TCGCGGCGGT CACCAAGGTG
GAGTCGGCCG ATCAGCAGAC CCAAAGGCTG GCCGTGGCGG CGCAGTCGAT GGGCGGCATC
GTCGAACTGA TCGGCAACAT CACCGGCCAG ATCAACCTAT TGGCGCTCAA CGCCACGATC
GAATCGGCGC GCGCCGGCGA GGCCGGCCGC GGCTTTGCGG TGGTCGCCTC CGAGGTGAAG
AATCTCGCCA ACCAGGCCAC CGACAAGATC GGCAGCGAAA TCTCCGGGCT GAACGGCATC
TCCGCCGACG TGGTCGCGGC GTTGCTGGCG ATCAAGCAGG AAATCCAGAG CGTCAGCGAA
TTCGTCACCG CCACCGCCGC CGCGGTCGAG CAGCAGAGCG AGGTGACGGC CGAGATGTCA
TCGAGCATGC AGCAGGCCGC CGCCGAGGCC GCCAGCATCG GGCATTAG
 
Protein sequence
MAFWTRSSDA ESAAQMQAIS RSQGVIEFAI DGTILTANQN FLDALGYSLA EIKGKHHRMF 
VDPDERDGAA YRAFWASLGR GEYQAAEYKR IGKGDREVWI QASYNPIMDR SGKPAKVIKF
ATDVTARKIR GMEDAGKIAA ILRAQAVIEF NLDGTIITAN DNFLGVMGYA LADVVGKPHS
MFVEPAERDS AAYRAFWAGL NRGEYLAAEF KRIGKGGREV WILASYNPII DEKGKVFKVV
KFATEVTQQK LRFTELDGQV QAIGKSQAVI EFAMDGTILG ANGNFLDAMG YSLAEIKGRH
HSMFVDPSER DGAAYRAFWA TLNRGEFQAA EYKRIGKGGR EVWIQATYNP ILDLNGKPFK
IVKYASDTTA RVIARIKSDR VSRMLESVAD GAEKLNASVR DIAEAMTKSK STAIAAVTKV
ESADQQTQRL AVAAQSMGGI VELIGNITGQ INLLALNATI ESARAGEAGR GFAVVASEVK
NLANQATDKI GSEISGLNGI SADVVAALLA IKQEIQSVSE FVTATAAAVE QQSEVTAEMS
SSMQQAAAEA ASIGH