Gene Rsph17025_0405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0405 
Symbol 
ID5082493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp399335 
End bp401653 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content70% 
IMG OID640481957 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001166616 
Protein GI146276457 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG ACGCCATCCG CCGCATTCCC AAGCTGATCG CCGCCGAGAT CGCGGCCCGA 
CCCGAGCAGG TCTCGGCGGC CATCGAGCTT CTCGACGGCG GCGCGACCGT TCCCTTCGTC
GCCCGCTACC GCAAGGAGGT GACGGGCGGC CTCGACGACA CGCAGCTGCG CACGCTCTCC
GAGCGGCTGG CCTATCTGCG CGAGCTGGAG GCGCGGCGCG AGACGATCCT GTCCTCGATC
CGCGATCAGG GCAAGCTGAC GGCCGAGCTG GAACAGGCCG TGGGCAAGAC CACGACCAAG
TCCGAGCTGG AAGACATCTA CCTGCCCTAC AAGCCCAAGC GGCGGACCAA GGCGATGATC
GCGCGCGAGA ACGGCCTCGG GCCGCTGGCC GAGGCGATCC TTGCCGACCG TGCGGCGGTG
CCGGCCGATC TGGCGAAGGC CTTTGTCACC GAAGCCGTGC CGGACGTGAA GGCCGCGTTG
GAGGGCGCGC GCGACATCCT GGCCGAGGGG CTGTCCGAGA ACGCGGACCT GCTGGGGCGG
CTCCGCGCCC ACATGAAGCA GGTCGGCCGC GTCTCGGCCA AGGTGGTCGA GGGCAAGGAG
GCCGAGGGCG CCAAGTTTTC GGACTATTTC GCCCATTCCG AGGCTTGGGC CACCGCTGCG
GGCCACCGGG TGCTGGCCAT GCTGCGTGGG CGGAACGAGG GCGTGCTGGT GCTCGATCTC
GAAGTCGATG CCGACGCGGC CCGCGGCGAA AGCCCGGCCG AGCGGATGGC CGGCATGGCG
CTTCAGGCGG CGGGCAAGGG GCCGGGCGAT CAGTGGCTGC GCGAGGCGGC GTCCTGGGCG
TGGCGGGTGA AGCTGAAGAC CTCGCTGACG CTGGATCTCA TGGCGGAGTT GCGCGAGCGG
GCCGAGGCCG AGGCGATCGG CGTCTTTGCG CGGAACCTCA AGGACCTGCT GCTGGCTGCG
CCGGCGGGGG GCAAGGTCAC GATGGGGATC GACCCGGGCA TCCGCACCGG CTGCAAGGTT
GCGGTGGTGG ATGCGACGGG CAAGCTGCTC GCCACCTCGA CCATCTATCC GTTCCAGCCG
AAGAACGACC TGCGCGGCTC GCAGGTGGAG CTTCTGAAGC TGATCCGCGC GCATGGGGTG
ACGCTGATCG CCATCGGCAA CGGAACGGCC AGCCGCGAGA GTGAGAAGAT GGTGGCCGAC
CTGCTGGCCG CCATGCCCGC CGAGATGCCG AAGCCCGTGA AGGTGATCGT CAGCGAGGCC
GGCGCCTCGG TCTATTCCGC GAGCGCGCTG GCGGCGGCCG AGTTCCCGGA TCTCGACGTG
TCCCTGCGCG GCGCGGTCTC GATCGCGCGC CGGCTGCAGG ATCCGCTGGC GGAACTGGTG
AAGATCGAGC CGAAGTCGAT CGGCGTCGGC CAGTATCAGC ATGATGTGGA CCAGCATCGC
CTCGGACGTT CGCTGGAGGC GGTTGTGGAA GACGCGGTGA ACGCGGTGGG GGTGGATCTG
AACACCGCCT CGGCGCCGCT TCTGGCGCGC GTCTCGGGCG TGGGGCCGGG GCTGGCCGAG
GCGATCGTGC AGCACCGCAA CACGAACGGC CCCTTTGCGA AGCGCCGCGA CCTGCTGAAG
GTCGCGCGCC TCGGCCCCCG CGCCTTCGAG CAGGCGGCGG GCTTCCTGCG CATCCCGAAC
GGGACCGAGC CGCTCGATGC CTCGAGCGTC CACCCCGAGG CCTATGACGT GGCGCGCAAG
ATCGTGGCCG CCTGCGGGCG CGACCTGCGC AGCCTGATGG CCGAGCCGCA GCGGCTGCGC
GCGCTCGATC CGCAGGACTT CACCGACGAC CGCTTCGGCC TGCCGACCGT CCGCGACATC
CTTGCGGAAC TGGAAAAGCC CGGCCGCGAC CCGCGCCCGA CCTTCAAGAC CGCGACCTTC
ACCGAAGGGG TCGAGGAGAT TTCGGACCTC AAGGTGGGGA TGCTGCTGGA AGGGACGGTG
ACGAACGTCG CGGCCTTCGG CGCCTTCGTC GATGTGGGCG TGCATCAGGA CGGGCTGGTG
CATGTCAGCC AGCTCGCCGA CCGTTTCGTG AAGGATCCGC ACGAGGTGGT GAAGGCCGGC
GACGTGGTGA AGGTCCGCGT GGTCGAGGTG GACGTGGCGC GCAAGCGCAT CGGGCTCACC
ATGCGCAAGG ACAGCGCCGA TGCCCGCGCG CAGGCGTCGG AGCGAAAGTC CGAGCCGCCG
CGCAAGGGCG GCAAGCCGCT GCCCCGGCAG GCCGCTCCGC GCGAGAGTGG CGCGAGCAAT
CCCTTTGCCG ATGCGCTGAA GGGATTCAGG CGGGACTGA
 
Protein sequence
MSTDAIRRIP KLIAAEIAAR PEQVSAAIEL LDGGATVPFV ARYRKEVTGG LDDTQLRTLS 
ERLAYLRELE ARRETILSSI RDQGKLTAEL EQAVGKTTTK SELEDIYLPY KPKRRTKAMI
ARENGLGPLA EAILADRAAV PADLAKAFVT EAVPDVKAAL EGARDILAEG LSENADLLGR
LRAHMKQVGR VSAKVVEGKE AEGAKFSDYF AHSEAWATAA GHRVLAMLRG RNEGVLVLDL
EVDADAARGE SPAERMAGMA LQAAGKGPGD QWLREAASWA WRVKLKTSLT LDLMAELRER
AEAEAIGVFA RNLKDLLLAA PAGGKVTMGI DPGIRTGCKV AVVDATGKLL ATSTIYPFQP
KNDLRGSQVE LLKLIRAHGV TLIAIGNGTA SRESEKMVAD LLAAMPAEMP KPVKVIVSEA
GASVYSASAL AAAEFPDLDV SLRGAVSIAR RLQDPLAELV KIEPKSIGVG QYQHDVDQHR
LGRSLEAVVE DAVNAVGVDL NTASAPLLAR VSGVGPGLAE AIVQHRNTNG PFAKRRDLLK
VARLGPRAFE QAAGFLRIPN GTEPLDASSV HPEAYDVARK IVAACGRDLR SLMAEPQRLR
ALDPQDFTDD RFGLPTVRDI LAELEKPGRD PRPTFKTATF TEGVEEISDL KVGMLLEGTV
TNVAAFGAFV DVGVHQDGLV HVSQLADRFV KDPHEVVKAG DVVKVRVVEV DVARKRIGLT
MRKDSADARA QASERKSEPP RKGGKPLPRQ AAPRESGASN PFADALKGFR RD