Gene Rpal_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1872 
Symbol 
ID6409531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2012593 
End bp2015406 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content66% 
IMG OID642711760 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_001990873 
Protein GI192290268 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.134352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC TTCTTCGTGA GTTTTTGACG GAGACCTTCG AGAGCCTGGA CACGGTTGAC 
AACCAGTTGG TCCGGTTTGA GCAGGAGCCG AACAACGCGA AGATATTGGA CAATATTTTT
CGTCTTGTTC ACACCATCAA GGGAACGTGC GGGTTTCTAG GGTTGCCGCG GCTTGAAGCG
CTTGCGCACG CGGCCGAGAC CCTGATGGGC AAATTCCGGG ACGGAATGCC GGTGACGGGG
GAGGCGGTGA CGCTGATCCT GACCACGATC GACCGGATCA AGGACATTCT GACCCAGCTG
GAGGCGACCC AGGCCGAGCC CGAGGGCGAG GACGGCGACC TGATCGGGGA GCTGGAGCGG
CTGTCGATGC GCTCGCCGGA AGAGATCGCG GCCGAGCTCG GCGGCGCTGC GCCGGTGGAG
GTTGCCGAAG TCGAAGCCCC TGCCGAAGCT GTTGTGGCCG AGACGGCCGA CGCCAATTCG
ACCGAAGGCA CCCTGGTGGC GCAGACGCTG GAGCGTCCGC TGCGGCCGGG TGAAGTGTCG
CTGGACGAGC TTGAGCGCGC CTTCCGCGAG ACCGAGATCG AGATGGCCTC GCCGCCGCTG
CAGCCCGCCG TGAGCGAAGC TCCGGCTGCT GTGGCTGAAG CCGCGCCGCC TGAACCGAAG
CCGGCCAAGC CCGCCAAACC GGCCGCCAAG CCGGCGGCGA AGAAGTCCGG CGGCGAAGGC
GAGGGCGCAG CTGAAGGTGG GGCCGCTGGC GGCGTCGCCA ACCAGTCGAT CCGCGTCAAC
GTCGATACCC TCGAACACCT GATGACGATG GTGTCGGAGC TGGTGCTGAC CCGTAACCAG
CTGCTCGAGA TCAGCCGCCG CCACGAGGAC AACGAGTTCA AGGTGCCGCT GCAGCGGCTC
TCCACCGTCA CCGCCGAGCT GCAGGACGGG GTGATGAAGA CCCGGATGCA GCCGATCGGC
AACGCCTGGC AGAAGCTGCC GCGGATCGTG CGCGATCTGG CCGCCGAACT CGGCAAGCAG
ATCGAGCTGG AGATGCACGG TGCCGACACC GAGCTCGACC GCCAGGTGCT CGACCTGATC
AAGGATCCGC TCACCCATAT GGTGCGCAAC TCCGCCGACC ACGGGCTGGA GAAGCCCGAG
GACCGGGCGC GCGCCGGCAA GCCCGAGCAG GGCACCATCC GCCTGTCCGC CTATCACGAG
GGCGGCCACA TCGTGATCTG CATCGCCGAC AACGGCCGCG GGCTGGACAC CGAACGGATC
AAGGCCAAGG CCTTGGCCAA CGGGCTGGTC ACCGAGGCCG AACTCGAGAA GATGACCGAG
GCGCAGATCC ACAAGTTCAT CTTCGCGCCG GGCTTCTCGA CCGCCGCCGC CGTCACCTCG
GTGTCCGGCC GCGGCGTCGG CATGGACGTG GTGCGCACCA ATATCGACCA GATCGGCGGC
ACGATTGAAG TGAAGTCGGT CGCGGGCGAA GGCTCGGCCA TCACCATCAA GATCCCGCTC
ACCCTGGCGA TCGTCTCGGC GCTGATTGTC GAAGCCGGCG GCGACCGGTT CGCGATCCCG
CAGCTCGCGG TGGTCGAGCT GGTGCGGGCA CGGGCCAACT CCGAGCACCG CATCGAGCGG
ATCAAGGATA CGCCGGTCCT CAGACTGCGC GACAAGCTGC TGCCGCTGAT CCACCTGAAG
AAGCTGCTCG GCATCGACGA GGGCGCCAAC AGCGAGCCGG AGAACGGCTT CATCGTGGTG
ACCCAGGTCG GCAGCCAGAC CTTCGGCATC GTGGTCGACG GCGTGTTCCA CACCGAAGAA
ATCGTCGTCA AGCCGATGTC GACCAAGCTG CGTCACATCG GAATGTTCTC GGGCAACACC
ATCCTGGGCG ACGGCGCGGT GATCATGATC GTCGATCCGA ACGGGATCGC GCAGGCGCTC
GGCACCGCGG TGTCGGCGCA GCACGATATC TCCGACCAGG CGGCGGCGAG CCGCAACGCC
TCGGCCGAAC AGCTCACCTC GCTGCTGGTG TTCCGCGCCG GCTCGAGCCA GCCGAAGGCG
GTGCCGCTGT CGCTGGTGAC GCGCCTGGAA GAGATCGCCT CCGACAAGAT CGAGATGTCG
AACGGCCGCT ACATGGTGCA GTACCGCGAC CAGCTGATGC CGCTCGTGCT GATGGAAGGC
GTCGAGGTCG CCACCAGCGG CGTGCAGCCG ATCCTGGTGT TCGCCGACGA GGACCGGTCG
ATGGGCCTTG TGGTCGACGA GATCGTCGAC ATCGTCGAGG AGCATCTGCA CATCCAGGTC
GGCTCCAGCC GCGAGGGCAT TCTCGGCTCT GCGGTGATCA AGGGCCAGGC CACCGAGGTG
ATCGACGTCG CGCACTTCCT GCCGATGGCG TTCTCCGACT GGCTGGCGCG CAAGGAGATG
AAGCAGTCGC TGACCACCCG CTCGGTGCTG CTGGTCGATG ACTCGGCGTT CTTCCGCAAC
ATGCTGGGTC CGGTGCTGAA GGCGGCGGGC TACAAGGTGC GGGTCGCGAC CTCGGCGGTC
GAGGGCCTGT CGGTGCTGCG CTCGGGTGCG CAGTTCGACG TGATCCTGAC CGACATCGAG
ATGCCGGAGA TGAACGGCTT CGAGTTCGCC GAGGCGATCC GCTCCGACAC GAAGATGTCG
AACCTGCCGG TGATCGCGCT GAGTTCGCTG GTGTCGCCGG CGGCGATCGA GCGCGGCCGC
CAGGCCGGTC TGACCGACTA CATCGCCAAG TTCGATCGGC CCGGCCTGAT CGCTGCGCTG
AAGGAGCAGA CCACGATGCA TGCGACGCCC GAAGTGCTGG AGCAGGCGGC ATGA
 
Protein sequence
MDDLLREFLT ETFESLDTVD NQLVRFEQEP NNAKILDNIF RLVHTIKGTC GFLGLPRLEA 
LAHAAETLMG KFRDGMPVTG EAVTLILTTI DRIKDILTQL EATQAEPEGE DGDLIGELER
LSMRSPEEIA AELGGAAPVE VAEVEAPAEA VVAETADANS TEGTLVAQTL ERPLRPGEVS
LDELERAFRE TEIEMASPPL QPAVSEAPAA VAEAAPPEPK PAKPAKPAAK PAAKKSGGEG
EGAAEGGAAG GVANQSIRVN VDTLEHLMTM VSELVLTRNQ LLEISRRHED NEFKVPLQRL
STVTAELQDG VMKTRMQPIG NAWQKLPRIV RDLAAELGKQ IELEMHGADT ELDRQVLDLI
KDPLTHMVRN SADHGLEKPE DRARAGKPEQ GTIRLSAYHE GGHIVICIAD NGRGLDTERI
KAKALANGLV TEAELEKMTE AQIHKFIFAP GFSTAAAVTS VSGRGVGMDV VRTNIDQIGG
TIEVKSVAGE GSAITIKIPL TLAIVSALIV EAGGDRFAIP QLAVVELVRA RANSEHRIER
IKDTPVLRLR DKLLPLIHLK KLLGIDEGAN SEPENGFIVV TQVGSQTFGI VVDGVFHTEE
IVVKPMSTKL RHIGMFSGNT ILGDGAVIMI VDPNGIAQAL GTAVSAQHDI SDQAAASRNA
SAEQLTSLLV FRAGSSQPKA VPLSLVTRLE EIASDKIEMS NGRYMVQYRD QLMPLVLMEG
VEVATSGVQP ILVFADEDRS MGLVVDEIVD IVEEHLHIQV GSSREGILGS AVIKGQATEV
IDVAHFLPMA FSDWLARKEM KQSLTTRSVL LVDDSAFFRN MLGPVLKAAG YKVRVATSAV
EGLSVLRSGA QFDVILTDIE MPEMNGFEFA EAIRSDTKMS NLPVIALSSL VSPAAIERGR
QAGLTDYIAK FDRPGLIAAL KEQTTMHATP EVLEQAA