Gene Rru_A0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0521 
Symbol 
ID3834489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp614264 
End bp617092 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content64% 
IMG OID637824605 
ProductCheA Signal transduction histidine Kinases (STHK) 
Protein accessionYP_425612 
Protein GI83591860 
COG category[K] Transcription
[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.58429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC TGCTCAGCGA GTTCCTCACG GAAACCTCGG AAAGCATTGC CACGCTTGAC 
GTGGAACTCG TGAATTTGGA GCGGAATCCA AACGACAAGG GTATCCTTTC GAACATTTTC
CGTCTGGTCC ACACCATCAA GGGAACCTGT GGTTTCCTTG GGCTGCCCCG CCTTGAAAGC
TTGGCCCATG CCGGCGAGAA CGTGCTAGGC AAGTTCCGTG ACGGCGATCT TGAAGTGACC
CCGGCGGCGG TGACGGTGAT TTTGCACACC ATCGACCGCA TCAAGGAAAT CCTGGGTCAT
CTGGAAGCCA ATGAGGTCGA GCCCGAGGGC GACGACAAGG ATTTGAAAGC CCATCTGAAC
GCCTTCGCCG AGGGCAAGCA GCCGAGCGTG GCGGCGATGG CCCCGGCCCC GGCCCGCGCC
GCGGGCTCGG GTCCGGCGAT TTCGGAAGGC GGCTATCCGG TCGCCGCCGA GCTGCTGGCC
GAGGTGAGCG AGGCCGTCGC CAAGGGCAAG CGCGCCGCGA CCGAGGCCGA GTTGGCCGCC
GAGCTGGCGG CCGAACTGGC GGCGGCGGAG AAAGCCGAAC AGGCGGTCGC CGCGCCCGTT
CCCGAAGTCA TTCCCGAGGT CAAGGCCGCG ACTCCGGTGG TCCAGCCGGC CAAGCCGCCG
GCGGTGACCG CCGCCCATGA CACCAACGGC CCCAACGGCG GTGGTGGCGG CGAGCAGAAG
GAAGGCTCGG TCGCCTCGCA GTCGATCCGC GTCAATGTGG AACTGCTTGA AAACCTGATG
ACCCTGGTGT CTGAACTGGT GCTGACGCGC AACCAGTTGC TCCAGATGGT ACGCGGCAGC
GACGATTCGG AATTCGTCGC GCCGCTCCAG CGGCTTTCCC ATATCACCAC CGACCTTCAG
GAAGGGGTGA TGAAAACCCG CATGCAGCCG ATCGGCAACG CCTGGGCCAA GCTGCCGCGC
ATCGTGCGCG ATCTGTCGAT CGAAATGCAC AAGAAGATCG ATCTGCAGAT GTACGGGGCC
GATACGGAAC TTGACCGTCA GGTTCTCGAG ATGATCAAGG ACCCGCTGAC CCACATGGTG
CGCAATTCCG GCGATCACGG CCTGGAATTC CCCGACGAGC GGGCGGCGGC GGGCAAGCCC
GAGACCGGCG TCATCAAGCT CAACGCCTAT CACGAGGGCG GCCACATCAT CATCGAGATT
AGCGATGATG GGCGCGGCCT CAATCTTGAA CGCATCCGCG CCAAGGCGCT CTCCAACGGC
CTCGCCACCG AGGCCGAACT GGAGAATATG ACCGATCAGC AGATCGCCCA GTACATCTTC
CGCGCCGGGC TTTCCACCGC CGAAAAGGTC ACGGCCGTAT CGGGCCGCGG CGTCGGCATG
GACGTGGTCA AGACCAATAT CGAAAAGATC GGCGGCACGG TCGAGCTGAA GACTTGGCCG
GGCAAGGGCT CGCGCTTCGT CATCAAGATT CCGCTGACCC TGGCCATCGT CTCCGCCCTG
ATCGTCGAGG CCTCGGGCGA GCGCTTCGCC ATCCCGCAGA TCTCGGTGCT TGAACTGGTG
CGGGTGACGG CCAATTCGGA AACCACCATC GAGCAGATCA ACACCGCGCC GGTCTTGCGC
CTGCGCGATC GCCTGATGCC GCTGGTGTCG CTTTCGGCCC TGTTGCGCCT GGATGACGGC
GATGACGAGG AACTGGGCAA GACCGCCGAC GCCACGGCCG GACGGCGCGA TGAAACCTTC
ATCGTCGTCA GTCAGGTTGG CACCTATACC TTCGGCATCA TCGTCGACCG CGTCTTCGAC
ACCGAGGAAA TCGTCGTCAA GCCGGTCGCT CCGATCCTGC GCCATGTGTC GATGTTCTCA
GGCAACACCA TCCTGGGCGA CGGCAGCGTG ATCATGATCC TTGATCCCAA CGGCATCGCC
AGCGCCACCG GCGAGGTGAC CATGGGGTCG GCGTCGGGGA CCACCGAAGC CGCCCAGTCC
CACGAGTTCG TCGGCGAGGA TCGCACCTCG CTGCTGGTCT TCCGCGCCGG GGGCAAGGAT
CTCAAGGCCG TGCCGCTGGC CCTGGTCGCC CGCCTCGAGG AAATCGAGAC CGACAAGATC
GAGCATTCCT TCGGCAAGCC GGTGGTTCAG TACCGGGGCC AGTTGATGCC GCTGGTCGGC
ATCCACGATG AATTCACCCT TGCCGGCGAA GGCCGCCAGC CGGTTCTGGT CTTCTCGGAC
CGTGACCGCA CCATGGGTCT GGTCGTCGAT GAAATCGTTG ATATCGTCGA AGACCACTTG
AAGGTGGAAT TGCGCGCCGA TCTGCGCGGC GTCGTCGGAA CGGCGGTGGT CAACGGCAAG
GCCACCGATA TCATCGATAC CGGGTATTAT CTGACCAAGG CCTTCGGCGA TTGGTTCGGC
ACGATGAAGA GCGATGCCTT CGGCGAGCAA AAGAGCGCGA TCCGGGTTCT TCTGGTCGAC
GACAGCCCGT TCTTCCGCAA TCTGCTGACG CCGCTGCTGT CGGTTTCGGG CTATGCGGTG
ACCGCCGTCG AATCCGCTGA AAAGGCCCTG GAACTGCGTG AAAAGGGCCA TTCCTTCGAA
GCCATCATCA GCGATATCGA GATGGCCGGC ATGGATGGCT TCTCCTTCGC CGCCGCCATC
CGCGCCGATG GCCGGTGGGG CAATCTGCCG CTGATCGCCC TGTCGAGCCA CGCCACCGAG
CGCGATCTTC AGCGCGGCCG GGAAGCCGGC TTCGATGACT ATGTCGCCAA GTTCGACCGG
GACAGCCTGC TTGAGGTCCT CGGGCAACTG GTTGGCGGAC AGCCCGCTCT GGTGGCCCAG
GAGGGCTGA
 
Protein sequence
MDDLLSEFLT ETSESIATLD VELVNLERNP NDKGILSNIF RLVHTIKGTC GFLGLPRLES 
LAHAGENVLG KFRDGDLEVT PAAVTVILHT IDRIKEILGH LEANEVEPEG DDKDLKAHLN
AFAEGKQPSV AAMAPAPARA AGSGPAISEG GYPVAAELLA EVSEAVAKGK RAATEAELAA
ELAAELAAAE KAEQAVAAPV PEVIPEVKAA TPVVQPAKPP AVTAAHDTNG PNGGGGGEQK
EGSVASQSIR VNVELLENLM TLVSELVLTR NQLLQMVRGS DDSEFVAPLQ RLSHITTDLQ
EGVMKTRMQP IGNAWAKLPR IVRDLSIEMH KKIDLQMYGA DTELDRQVLE MIKDPLTHMV
RNSGDHGLEF PDERAAAGKP ETGVIKLNAY HEGGHIIIEI SDDGRGLNLE RIRAKALSNG
LATEAELENM TDQQIAQYIF RAGLSTAEKV TAVSGRGVGM DVVKTNIEKI GGTVELKTWP
GKGSRFVIKI PLTLAIVSAL IVEASGERFA IPQISVLELV RVTANSETTI EQINTAPVLR
LRDRLMPLVS LSALLRLDDG DDEELGKTAD ATAGRRDETF IVVSQVGTYT FGIIVDRVFD
TEEIVVKPVA PILRHVSMFS GNTILGDGSV IMILDPNGIA SATGEVTMGS ASGTTEAAQS
HEFVGEDRTS LLVFRAGGKD LKAVPLALVA RLEEIETDKI EHSFGKPVVQ YRGQLMPLVG
IHDEFTLAGE GRQPVLVFSD RDRTMGLVVD EIVDIVEDHL KVELRADLRG VVGTAVVNGK
ATDIIDTGYY LTKAFGDWFG TMKSDAFGEQ KSAIRVLLVD DSPFFRNLLT PLLSVSGYAV
TAVESAEKAL ELREKGHSFE AIISDIEMAG MDGFSFAAAI RADGRWGNLP LIALSSHATE
RDLQRGREAG FDDYVAKFDR DSLLEVLGQL VGGQPALVAQ EG