Gene Rsph17029_2430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2430 
Symbol 
ID4897846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2562650 
End bp2564971 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content71% 
IMG OID640113028 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001044304 
Protein GI126463190 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.388315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0422366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG ACGCCATCCG CCGCATCCCC AAGCTGATCG CCGCCGAGAT CGCGGCCCGT 
CCCGATCAGG TCGCGGCCGC CATCGATCTT CTCGACGGCG GCGCGACCGT GCCCTTCGTC
GCCCGCTACC GCAAGGAGGC GACGGGCGGC CTTGACGACA CGCAGCTGCG CACGCTCTCC
GAGCGGCTGG CCTATCTGCG CGAGCTCGAG GCGCGGCGCG AGACGATCCT CGGCTCGATC
CGCGATCAGG GCAAGCTCAC GCCCGAGCTC GAGGCGCAGG TGGTGAAGGC CGCCACCAAG
TCCGAGCTCG AGGACATCTA TCTGCCCTAC AAGCCGAAGC GGCGCACAAA GGCGATGATC
GCGCGCGAGA ACGGGCTCGG GCCCCTGGCC GAGGCGATCC TCGCCGACCG TGCGGCGGTG
CCGGCGGAGC TGGCGCAGGC CTATGTTTCC GAGGCGGTGC CGGACGTGAA GGCGGCGCTT
GAGGGCGCGC GCGACATCGT GGCCGAGGGA CTGGCCGAGA ATGCGGATCT TCTGGGGCGG
CTGCGCGCCC ACATGAAGCA GGTGGGGCGG ATCGCGGCGA AGGTCGTCGA AGGGAAGGAG
GCCGAGGGCG CCAAGTTCTC GGACTATTTC GCCCATTCCG AGGCCTGGGC CACCGCGGCG
GGCCATCGGG TGCTCGCCAT GCTGCGGGGG CGCAACGAAG GCGTGCTGAC GCTCGATCTC
GAGGTCGATG CCGATGCGGC CCGCGGCGAG AGCCCGGCCG AGCGCATGGC CGGCCTCGCG
CTGCAGGCGG CCGGCAGGGG GCCGGGCGAC CAATGGCTGC GCGAGGCGGC CTCCTGGGCC
TGGCGGGTGA AGCTCAGAAC CTCGCTGACG CTGGACCTCA TGGCCGAGCT GCGCGAGCGG
GCCGAGGCCG AGGCGATCGG GGTCTTCGCG CGGAACCTCA AGGATCTGCT GCTGGCGGCC
CCCGCCGGCG GCAAGGTCAC GATGGGGATC GACCCGGGCA TCCGCACCGG CTGCAAGGTG
GCGGTGGTGG ATGCGACGGG GAAGGTTCTG GCCACCTCGA CCGTCTATCC GTTCCAGCCC
AAGAACGACC TGCGCGGGGC GCAGATCGAG CTGATGAAGC TGATCCGCGC CCATGGCGTG
ACGCTGATCG CCATCGGCAA CGGCACCGCG AGCCGCGAGA GCGAGAAGAT GGTGGCCGAC
CTTCTGGCGG CGCTGCCGGG AGACGCGCCG AAGCCCGTGA AGGTGATCGT GAGCGAGGCC
GGCGCCTCGG TCTATTCGGC GAGCGCGCTG GCGGCGGCCG AGTTCCCCGA TCTCGACGTC
AGCCTGCGCG GCGCGGTCTC GATCGCGCGG CGGCTGCAGG ATCCGCTGGC AGAGCTGGTG
AAGATCGAGC CGAAGTCGAT CGGCGTGGGC CAGTATCAGC ATGATGTGGA CCAGCACCGT
CTGGGCCGCT CGCTGGAGGC AGTGGTCGAG GATGCGGTGA ACGCGGTGGG GGTGGATCTG
AACACGGCCT CGGCGCCGCT TCTGGCGCGC GTCTCGGGCG TGGGGCCTGG GCTGGCCGAG
GCCATCGTGC AGCACCGCAA CGCGAACGGT CCCTTCGCCA AGCGCCGTGA CCTGCTGAAG
GTCGCGCGTC TGGGCCCCCG CGCCTTCGAG CAGGCGGCGG GCTTCCTGCG CATCCCGGAC
GGGGCCGAGC CGCTCGACGC CTCGTCCGTT CACCCCGAGG CCTACGATGT GGCGCGCCGG
ATCGTGGCCG CCTGCGGGCG CGACCTGCGC AGCCTGATGG CCGAGCCGCA GCGGCTGCGG
GCGCTCGACC CGCAGGACTT CACCGACGAG CGCTTCGGCC TGCCGACCGT GAGGGACATC
CTCGCCGAGC TGGAGAAGCC CGGCCGCGAC CCGCGCCCGA CCTTCAAGAC CGCGACCTTC
ACCGACGGGG TCGAGGAGAT CTCGGACCTG AAGCCGGGGA TGCTTCTGGA AGGGACAGTG
ACCAACGTCG CGGCCTTCGG CGCCTTCGTC GATGTGGGGG TGCATCAGGA CGGGCTCGTG
CATGTGAGCC AGCTCGCCGA CCGCTTCGTG AAGGATCCGC ATGAGGTGGT GAAGGCAGGC
GACGTGGTGA AGGTGCGGGT GGTCGAGGTG GATGTGGCGC GCAAGCGCAT CGGCCTCACC
ATGCGCAAGG ACAGCGCCGA CGCCCGCGCG CAGGCCTCCG AGCGCCGGAG CGCCGAACCG
CCGCGCAAGG GCGGCAAGCC CTTGCCCCGG CAGGCCGCAC CGCGCGAGAC GAGCGCGGGC
AACCCGTTCG CGGATGCGCT GAAGGGCCTG CGTCGGGAGT GA
 
Protein sequence
MSTDAIRRIP KLIAAEIAAR PDQVAAAIDL LDGGATVPFV ARYRKEATGG LDDTQLRTLS 
ERLAYLRELE ARRETILGSI RDQGKLTPEL EAQVVKAATK SELEDIYLPY KPKRRTKAMI
ARENGLGPLA EAILADRAAV PAELAQAYVS EAVPDVKAAL EGARDIVAEG LAENADLLGR
LRAHMKQVGR IAAKVVEGKE AEGAKFSDYF AHSEAWATAA GHRVLAMLRG RNEGVLTLDL
EVDADAARGE SPAERMAGLA LQAAGRGPGD QWLREAASWA WRVKLRTSLT LDLMAELRER
AEAEAIGVFA RNLKDLLLAA PAGGKVTMGI DPGIRTGCKV AVVDATGKVL ATSTVYPFQP
KNDLRGAQIE LMKLIRAHGV TLIAIGNGTA SRESEKMVAD LLAALPGDAP KPVKVIVSEA
GASVYSASAL AAAEFPDLDV SLRGAVSIAR RLQDPLAELV KIEPKSIGVG QYQHDVDQHR
LGRSLEAVVE DAVNAVGVDL NTASAPLLAR VSGVGPGLAE AIVQHRNANG PFAKRRDLLK
VARLGPRAFE QAAGFLRIPD GAEPLDASSV HPEAYDVARR IVAACGRDLR SLMAEPQRLR
ALDPQDFTDE RFGLPTVRDI LAELEKPGRD PRPTFKTATF TDGVEEISDL KPGMLLEGTV
TNVAAFGAFV DVGVHQDGLV HVSQLADRFV KDPHEVVKAG DVVKVRVVEV DVARKRIGLT
MRKDSADARA QASERRSAEP PRKGGKPLPR QAAPRETSAG NPFADALKGL RRE