Gene RPC_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3803 
Symbol 
ID3969223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4228509 
End bp4231313 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content65% 
IMG OID637926913 
ProductCheA signal transduction histidine kinases 
Protein accessionYP_533656 
Protein GI90425286 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACT TGTTGCGCGA GTTTTTGACG GAGACGAATG AGAGCCTGGA CACGGTTGAC 
AACCAACTGG TCAGGTTCGA GCAGGACCCC AACAACGCCA AGATTTTGGA CAACATTTTC
CGGCTGGTCC ACACCATCAA GGGGACCTGC GGGTTTTTGG GTCTGCCGCG GTTGGAAGCT
TTGGCGCACG CCGCCGAGAC CCTGATGGGC AAATTCCGCG ACGGCATGCC GGTGACCGGC
GAGGCGGTGA CGCTGATCCT TCTGACCATC GACCGGATTA AAGAGATTCT AGGCGGGCTG
GAGGCCACCG AGGCCGAGCC CGAGGGCGTC GACCAGGATT TGATCGGCGA GTTGGAAGTG
TTGTCGCAAG CGCCGATGGC GCCGACGCTC GCGGCGGTGC CCGAGATGGC GGCGCCGGAG
GTGGTGCCTG AGGTCGTCGC CGCCCCCGAA GCGGCGGTCG CCGAGGGCAC GCTGGTGCCG
CAGATTCTGG AGCGCGCGCT ACGCCCCGGC GAAGTCTCGC TCGACGAATT GGAGCGGGCG
TTTCGCGAGA CCGAAGTCGC GGTGGAAATT GCGCAGCCGG TCGAAGCCAA AGCTGCCGAA
GCCGACCATG CCGAGCATAA GCCCGCCGAC GCCGCGCCGG TGGCCGCCGA GGCCAAATCC
GACGCCAAAT CCGACGCCAA GTCGGACGCC AAGCCAGGCA AGCCTGTGAC CAAGAAGAAG
ACCGCCGTGG AATTGGATAT GCCGCTGCAC GATTCCGACA AGATCGCCAA CCAGTCGATC
CGGGTCAATG TCGACACCCT CGAGCACCTG ATGACCATGG TGTCCGAGCT GGTCTTGACC
CGCAATCAGC TCCTGGAGAT CAGCCGCCGC CACGAGGACA CCGAGTTCAA GGTGCCGTTG
CAGCGGCTCT CCACCGTCAC CGCCGAACTG CAGGACGGCG TGATGAAGAC AAGGATGCAG
CCGATCGGCA ACGCCTGGCA GAAGCTGCCG CGCATCGTGC GTGATCTGGC CTCCGAACTC
GGCAAGCAGA TCGAACTGGA GATGCACGGC GCCGACACCG AGCTCGACCG CCAGGTGCTC
GACCTGATCA AGGATCCGTT GACGCATATG GTGCGCAACT CCGCCGATCA CGGCCTCGAG
ACCCCGGCCG ATCGCGCCGC CGCCGGCAAG CCCGAGCAGG GCACGATTCG CTTGTCCGCC
TATCACGAGG GCGGCCACAT CGTGCTGTCG ATCGCCGACA ATGGTCGCGG CCTCGACACC
GCGCGGATCA AGGCCAAGGT GATCGCCAAC GGGCTGGCCT CCGAAGCCGA CGTCGAGAAG
ATGTCGGAGA GCCAGATCCA CAAATACATC TTCGCGCCGG GGTTCTCCAC CGCGGCTGCG
GTCACCAGCG TGTCCGGCCG CGGCGTCGGC ATGGACGTGG TGCGCACCAA TATCGATCAG
ATCGGCGGCA CCATCGACAT CAAGTCGGTG GCCGGCGAAG GCTCCAGCGT CACTATCAAG
ATCCCGCTGA CCTTGGCGAT CGTCTCGGCG CTGATTGTTG AGTCCGCCGG CAACCGCTTC
GCCATCCCGC AATTGGCGGT GGTCGAGCTG GTGCGGGCGC ACGCCAATTC CGAGCACAAG
ATCGAGCGCA TCAAGGACAC GCCCGTCCTG CGCTTGCGCA ACAAGCTGCT GCCGTTGATG
CATCTGCGCC ACCTGTTGCG GATCGACGAC GGCAAGGTCA CCGAGCCGGA GAACGGCTTC
ATCGTGGTGA CCCAGGTCGG CAACCAGACC TTCGGCATCG TCGTCGACGG CGTGTTCCAC
ACCGAAGAAA TCGTCGTCAA GCCGATGTCG ACCAAGCTGC GGCACATCGG CATGTTCTCC
GGCAACACCA TCCTGGGCGA CGGCGCGGTG ATCATGATCG TCGATCCCAA CGGCATCGCG
CAGGCGCTCG GCACCTCGGT CGCAGCCCAG CACGACATCG CCGAAGACAA CGCCGCGATC
CGCGCCTCCT CGGCCGATCA GTTGACCTCG CTGCTGGTGT TCCGCGCCGG TTCGGCGCAG
CCCAAGGCGG TGCCGCTGTC GCTGGTGACG CGGCTCGAAG AAATCGCCGC CGACAAGATC
GAGTTCTCCA ACGGCCGCCA CATGGTGCAG TACCGCGATC AGCTGATGCC GCTGGTGACG
ATGGACGGCG TCAGCGTCAA GACCTCCGGG GCGCAGCCGA TCCTGGTGTT CGCCGACGAG
GACCGCGCGA TGGGCCTCGT GGTCGACGAG ATCGTCGACA TCGTCGAAGA GCATCTGCAG
ATCGAAGTCG GCTCCAGCCA CGAGGGCATT CTCGGCTCCG CGGTGATCAA GGGCGCCGCC
ACCGAAGTGA TCGACGTCGG CCACTTCCTG CCGATGGCGT TCGCCGACTG GTTCAAGCGC
AAGGAAATGC GGCCCTCCAC CACCGCGCAG TCGATCCTGC TGGTCGATGA CTCGGCGTTC
TTCCGCAACA TGCTGGGCCC GGTGCTGAAG GCCGCGGGCT ACAAGGTGCG GCTCGCCACC
AACGCCCAGG AAGGCTTGGG CGTGCTGCGC TCCGGCCGGG AGTTCGACGC CATCCTCACC
GACATCGAGA TGCCGGACAT GAACGGCTTC GAATTCGCCG AGACCATCCG CGCCGACGCC
AAATTGGCGC AGACCCCGAT CATCGCGCTG AGCTCAATGA TCTCGCCGGC GGCGATCGAG
CGTGGCCGCC AGGCCGGCTT CCACGACTAC GTCGCCAAGT TCGACCGGCC GGGCCTGATC
GCCGCGCTGA AGGAACAGAC CACCAACATG AACCAGGCGG CGTGA
 
Protein sequence
MDDLLREFLT ETNESLDTVD NQLVRFEQDP NNAKILDNIF RLVHTIKGTC GFLGLPRLEA 
LAHAAETLMG KFRDGMPVTG EAVTLILLTI DRIKEILGGL EATEAEPEGV DQDLIGELEV
LSQAPMAPTL AAVPEMAAPE VVPEVVAAPE AAVAEGTLVP QILERALRPG EVSLDELERA
FRETEVAVEI AQPVEAKAAE ADHAEHKPAD AAPVAAEAKS DAKSDAKSDA KPGKPVTKKK
TAVELDMPLH DSDKIANQSI RVNVDTLEHL MTMVSELVLT RNQLLEISRR HEDTEFKVPL
QRLSTVTAEL QDGVMKTRMQ PIGNAWQKLP RIVRDLASEL GKQIELEMHG ADTELDRQVL
DLIKDPLTHM VRNSADHGLE TPADRAAAGK PEQGTIRLSA YHEGGHIVLS IADNGRGLDT
ARIKAKVIAN GLASEADVEK MSESQIHKYI FAPGFSTAAA VTSVSGRGVG MDVVRTNIDQ
IGGTIDIKSV AGEGSSVTIK IPLTLAIVSA LIVESAGNRF AIPQLAVVEL VRAHANSEHK
IERIKDTPVL RLRNKLLPLM HLRHLLRIDD GKVTEPENGF IVVTQVGNQT FGIVVDGVFH
TEEIVVKPMS TKLRHIGMFS GNTILGDGAV IMIVDPNGIA QALGTSVAAQ HDIAEDNAAI
RASSADQLTS LLVFRAGSAQ PKAVPLSLVT RLEEIAADKI EFSNGRHMVQ YRDQLMPLVT
MDGVSVKTSG AQPILVFADE DRAMGLVVDE IVDIVEEHLQ IEVGSSHEGI LGSAVIKGAA
TEVIDVGHFL PMAFADWFKR KEMRPSTTAQ SILLVDDSAF FRNMLGPVLK AAGYKVRLAT
NAQEGLGVLR SGREFDAILT DIEMPDMNGF EFAETIRADA KLAQTPIIAL SSMISPAAIE
RGRQAGFHDY VAKFDRPGLI AALKEQTTNM NQAA