Gene Rru_A3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3039 
Symbol 
ID3836485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3499877 
End bp3503080 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content68% 
IMG OID637827154 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_428121 
Protein GI83594369 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.635771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGCAGA AGGAGGCAGA CATCGCGACC GCGCCGACAT CCGAAACGCC CTGGGGGGAC 
CTGTTTTCGC AAGGCGGAGA CATGGGCCGA TTGATGGCGG CCTTTCCTTG GGACTCCTCG
CCGCTGGGGC CGGTCAGTGG CTGGCCAAGC CCCCTGCGCA CCGTCGTCGG CATGCTGTTG
CGCTCCAAAG CCCAAATCGT GCTGTTCTGG GGTCCGCACC TGGTGGCGCT TTACAACGAC
GCCTATGCCC CGACCATCGG CGGCAAGCAT CCCCGGGCCC TGGGCCGTCC GGCCCGGGAA
AACTGGGGAG AGCTTTGGGA TACCCTGGGT CCGATGCTCG GCCAAGTGCG GCTTACGGCC
CGGGCTTTTG GCGCGGACAA CTACCCCTTC CAGATAAACC GCCACGGCTT TCTCGAGCAG
GTTTATTTCG ACATTTCCTA TGATCCGGTT CCGGGCGCGG ACGGCGAGGT CGCCGGGGTG
TTCTGCACGG TCACCGAAAC GACCGCCCGT GTCTTCGCCG AGCGCCGACT GGCCGCCTTG
CGCGAATTAT CCATCGCCCT GTCGACCACC CTGGCGCGCA GCGATGGCCC CCTGTCGCTT
GGCGAGGCGC TGCCCAAGGC CGCCTTGGCG GCCATCGCCG GCCATTTGGC CCAAGACGTT
TCCCGCGCCG CCATCTACCG CCATGGCAGC CACGGCAAGC CTGTTCTGGT GGCGACCACC
GACCCCGCCC CCCGGACAAG ACCCCAGGCG ACGGGGACGA CCGAGGGATC GCCTTTTGCC
GAGATCCTGG AACGGGTTCG CCGCGAAGGG CGTCCCGTGC TGTGGGAAGG GGGCGGCGAT
GGCGGCCCCC AACGGCTGTT AGGCGTGCCG CTTACCGCCG AGCAGGGGCC CATGGGGGTT
CTGGTGGTGT CGGTCTGCCC GCTGGTCGCC GCCGATGCCG AATACCGCAA GGTCATCGAT
CTGGCGGCCG GGCAGATTTC CAGCGCCCTG ACCACGGCGG CCATTCTTGA ACGCGAGCGC
GAGCGCGCCG ACGATCTGAC CCGCGTCGCC GCCGATAACG CCCGGCTTTA CCGCGAGGCG
CAACGCGAGA TCGCCGACCG CCAAGCCATC GAAGCCCAAT TCCTCCATCT GACCGACACC
CTGGAAAAGA TGGTGGAGGA GCGAACCGGC GAACGCGACC GGCTGTGGCG GGTCTCGGGC
GAGTTGATGG CGGTCGGAAC CGCCGACAGC CATATCAAGG CCGTCAACCC GGCCTGGGAA
ACCCTGCTTG GCCATGACGA AAGGCTGCTG GTCGGCGCGC CGATGAGCCT GCTGGTCCAT
CCCGACGACC GCGAGCCGAG CACCGCCGCC TTGCGGGCCA TGGAACCCCA AGGCCCCCCC
TTGCGCTTCG AAAACCGTCT GCGCGATTCG GACGGGACCT ATCATTGGAT CGCCTGGACG
ATCGTTGCCC ACGGCGCCCT GTTCTATATG GTCGGCCGCG ACGTGACCGA GGAGAAACAG
GCCCGCGACG CATTGGCCGA GGCCCATCGC CAGTTGATCG CCCAGACCGC CGAGCGCGAA
AGGGCCGAGG AGGCCCTGCG CCAAGCCCAG AAGATGGAGG TCATCGGTCA GCTCACCGGC
GGCGTCGCCC ATGACTTCAA TAACCTGCTG ACCATCATCC TGGGCAATCT GGAAACCCTG
ATCCGCCACA TCGACGGAGC GTCGCCCCCC GACCCGGCGC GGGTAAGAAC CCTGGTCGAG
CGGGCGACCC TGGGCGCCGA GCGGGCCGCC GCCCTCACCC AGCGCCTTCT CGCCTTTTCC
CGACGCCAGC CCCTTGATCC GCGCCCCCTT GATCTCAACC GGCTGACCTT GGGGCTGATC
GACCTTTTGC ACCGGACGAT CGGCGAAAAA ATCGCCCTCG ACACCCGGCT GTCGATGGTG
CCGGTCGGCG TGCTCGCCGA TGCCAACCAG CTTGAAAGCG TCGTGCTCAA TCTGGCGCTC
AACGCCCGCG ACGCCATGCC CACGGGCGGA CGGTTGATCC TTGAAACCAC CGCCGATCTG
GTGATCGAGG CTCCGACCGC CGAAAAACGC CCGGCAAAGG GCAATCCCGC CGGCGCTCCG
CCGGGAACCT ATGCGGCCCT GCGCGTCACC GATACGGGAA AGGGCATGGA GGCCGGGGTT
CTGGCCCGGG TTTTCGATCC CTTTTTCACC ACCAAGGATG TCGGCCAGGG CACCGGCCTC
GGCCTGTCGC AGGCCTATGG CTTCATCAAG CAATCCGGCG GCCATATCGG CATAGAAAGC
GAACCGGGCC AGGGGACCAC CGTGACCATC TTGCTGCCCC GTCTTGAGGG GGCCCTGCCG
CTTCCCGAGG AGAGTGACCG CGACGAGACG CCCGCCGCCC TGGGCGGACT TGAGATCACG
CCCCCGCCGG GCGATCCGTC GATCCTGCTG CTGGTGGTCG AGGATGACCC GTCGGTGCGG
GCCCATTCCA CCGCGTCCTT GCGCGAACTG GGCTATCAGG TGATCGAAGC CGGGAATGGC
GCCGAGGCGC TTGGGCACTT GGCCGCCCAT CCCGATGTTG CCTTGATGTT CACCGATATC
GGTCTGCCCG GAGGAATGGA CGGTCGCCAA TTGGCCGAGC TGGCGCGGCG GAACCGCCCC
GATCTCGCCG TGGTGCTGAC CACCGGCTAC GCCCGCGACG CCCTGAACGG CAATGTTCCC
CTCGCCCCCC GGATGGCGCT GCTGACCAAG CCGTTTTCCT TCACCGCCCT GGCGCTCAAG
GTTCACGAGG TGCTTCCGCC CCGCCCCGCC CCCCAATCGT CCGCCCCGCA AGGCCCCGTA
ACCGTGTTGT TGGTCGAAGA CGATCCCTTG GTCAGCCTCG CCGCCGCCGA TCTGCTGCTG
TCGCTGGGCT GCGCGGTCGA TCAAGCTTAC AGCGTGGCCG ACGCCCTGGC CCAGGCCAGC
GCCGCCACCC CCGATGTCGC GGTGATCGAT ATCGGCTTGC CCGATGGACG CGGCGACGAT
CTGGCCCAAA CCCTGCGCCA GCGCCTTCCC GGTCTGCCGA TCGTCATCGC CTCGGGCTAT
GACCGTTCGG AAATCGACGC CCGCTTCGGC GACGATCCAC GCCTGCGGTT TCTGGGAAAA
CCCTATCTCG ACAGCCAGCT TGAAGCGGCG CTGACCGGCG CCCTTGGCCA CCCCCTGGAG
CGATCGACCG CCCCCCGGGC TTGA
 
Protein sequence
MGQKEADIAT APTSETPWGD LFSQGGDMGR LMAAFPWDSS PLGPVSGWPS PLRTVVGMLL 
RSKAQIVLFW GPHLVALYND AYAPTIGGKH PRALGRPARE NWGELWDTLG PMLGQVRLTA
RAFGADNYPF QINRHGFLEQ VYFDISYDPV PGADGEVAGV FCTVTETTAR VFAERRLAAL
RELSIALSTT LARSDGPLSL GEALPKAALA AIAGHLAQDV SRAAIYRHGS HGKPVLVATT
DPAPRTRPQA TGTTEGSPFA EILERVRREG RPVLWEGGGD GGPQRLLGVP LTAEQGPMGV
LVVSVCPLVA ADAEYRKVID LAAGQISSAL TTAAILERER ERADDLTRVA ADNARLYREA
QREIADRQAI EAQFLHLTDT LEKMVEERTG ERDRLWRVSG ELMAVGTADS HIKAVNPAWE
TLLGHDERLL VGAPMSLLVH PDDREPSTAA LRAMEPQGPP LRFENRLRDS DGTYHWIAWT
IVAHGALFYM VGRDVTEEKQ ARDALAEAHR QLIAQTAERE RAEEALRQAQ KMEVIGQLTG
GVAHDFNNLL TIILGNLETL IRHIDGASPP DPARVRTLVE RATLGAERAA ALTQRLLAFS
RRQPLDPRPL DLNRLTLGLI DLLHRTIGEK IALDTRLSMV PVGVLADANQ LESVVLNLAL
NARDAMPTGG RLILETTADL VIEAPTAEKR PAKGNPAGAP PGTYAALRVT DTGKGMEAGV
LARVFDPFFT TKDVGQGTGL GLSQAYGFIK QSGGHIGIES EPGQGTTVTI LLPRLEGALP
LPEESDRDET PAALGGLEIT PPPGDPSILL LVVEDDPSVR AHSTASLREL GYQVIEAGNG
AEALGHLAAH PDVALMFTDI GLPGGMDGRQ LAELARRNRP DLAVVLTTGY ARDALNGNVP
LAPRMALLTK PFSFTALALK VHEVLPPRPA PQSSAPQGPV TVLLVEDDPL VSLAAADLLL
SLGCAVDQAY SVADALAQAS AATPDVAVID IGLPDGRGDD LAQTLRQRLP GLPIVIASGY
DRSEIDARFG DDPRLRFLGK PYLDSQLEAA LTGALGHPLE RSTAPRA