Gene Rsph17029_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2266 
Symbol 
ID4897368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2398566 
End bp2402243 
Gene Length3678 bp 
Protein Length1225 aa 
Translation table11 
GC content62% 
IMG OID640112860 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001044140 
Protein GI126463026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.468941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGCGGA CAGATAAATA CTTTAACATA GCCAATATCT TGGTTGCTGC TGCGGGCGTT 
GCCACTGGCA CGGGCAGTGT CGCTATTGCT GCTGCCGGAA TGACAGCGGT GGACTCGCTG
ATTAAGATTG CCCGGAGCTG GCCACGAAAT ACCGACACTA TCATTTGGAA GGTCTCGGCC
CAACTGAAAC GGTCGCTTTC TGACATGCAC CTCTCCGACG AGCAGCGCCT GCTCATTCTT
CAAATGATCG AACAAGTCGA CCCGACACCA AACGCCATAA TAGCCTGCGC CCGTGATGCC
GACTGTCTAA CCGAACGCCT GTTGGACAGC TTGCAAGCTG ATCCCGCCCA CAGCACTATT
GCGGCACAGG ATGGCTTTCG GCAGGTGGTT TCGCCAATCC TCCGGCAACT GCTACTGGAC
CCCGATATCT GCGACGCTTT GCGCCCCATC CACGAGCAGG CAGTGGCGCA TGGCCTGGCG
GTGATCCAGG CCAAGCAAGA GGAGCAGACG ACTGTTCAGA AGACCCAGAC GGACCAATTG
AGCCGGATCG AGGAGATGAT CGCAGTTGCG CTGGGCTACA TTGACACCAC GCCTGATGCG
CTAGCCGCTC TGCAGGCAAT TGGCAATAGT CTGCCAAACT TACCTGGAAT CGTCGAGCGC
TACCAGGCAG CGCTGGCTAG AGGTGACGAG GCCCACGCGC GCCAAACGCT CGAGTCAGCG
TTGGCCGATG CAACTGCACA CCGGATGAAA ACAGCCGAAG CAAAGGAGGT AGCAGATCGC
CAATACGTCG ACGCCGCTGA CGCCGAGGCG CGGCTTACGG AAATCGATGC CGAACACAAA
TGGAAGGGCA AGGATCGCAT CGGCGCTATA CACCGCTTGG GCCACGCGCT GTCTATCGCC
GCAAGCACCG CTCTTCGGGG CGAGCTCGCG ACTCTACGGC AACGTCGATT CAATATCGTG
CTGCATGCGC AGCGCGGCTA TCCGGACGTC CGCGACTTAA TGCGGTTGCC AGATATGCCT
GCGCCAGATG TGGTTACCTA CTCCATCCTA ATGGCAAAAG CCCCCGACTT CACCGGGGCC
GAGGGCGTGC GGGCCGAGAT GGTGGCCGCC GGGATCAAGC CGAACGAAGT CACCTTCAAC
ACCCTCGTCG CCAAGGCCCC CGACTTCGCC AGAGCCGAGG CCGTGCGGGC CGAGATGGTG
GCCGCTGGGA TCAAGCCGAA CGAATTCACC TTCAACACCC TCATCGCCAA GGCCCCCGAC
TTCACCGGGG CCGAGGGCGT GCGGGCCGAG ATGGTGGCCG CTGGGATCAA GCCGAACGAA
GTCACCTTCA ACACCCTCAT CGCCAAGGCC GCCGACTTCG CCAGAGCCGA GGGCGTGCGG
GCCGAGATGG TGGCCGCTGG GATCAAGCCG AACGAATTCA CCTTCAACAC CCTCATCGCC
AAGGCCCCCG ACTTCACCGG GGCCGAGGGC GTGCGGGCCG AGATGGTGGC CGCTGGGATC
AAGCCGAACG AAGTCACCTT CAACACCCTC ATCGCCAAGG CCGCCGACTT CGCCAGAGCC
GAGGCCGTGC GGGCCGAGAT GGTGGCCGCT GGGATCAAGC CGAACGAAGT CACCTTCAAC
ACCCTCATCG CCAAGGCCCC CGACTTCACC GGGGCCGAGG CCGTGCGGGC CGAGATGGTG
GCCGCTGGGA TCAAGCCGAA CGAAGTCACC TTCAACACCC TCATCGCCAA GGCCGCCGAC
TTCGCCAGAG CCGAGGCCGT GCGGGCCGAG ATGGTGGCCG CTGGGATCAA GCCGAACGAA
GTCACCTTCA ACACCCTCAT CGCCAAGGCC CCCGACTTCA CCGGGGCCGA GGCCGTGCGG
GCCGAGATGG TGGCCGCTGG GATCAAGCCG AACGAAGTCA CCTTCAACAC CCTCATCGCC
AAGGCCGCCG ACTTCGCCAG AGCCGAGGCC GTGCGGGCCG AGATGGTGGC CGCTGGGATC
AAGCCGAACG AAGTCACCTT CAACACCCTC GTCGCCAAGG CCCCCGACTT CACCGGGGCC
GAGGGCGTGC GGGCCGAGAT GGTGGCCGCT GGGATCAAGC CGAACGAAGT CACCTTCAAC
GCCCTCATCG CCAAGGCCCC CGACTTCGCC AGAGCCGAGG CCGTGCGGGC CGAGATGGTG
GCCGCTGGGA TCAAGCCGAA CGAAGTCACC TTCAACGCCC TCGTCGCCAA GGCCCCCGAC
TTCGCCAGAG CCGAGGCCGT GCGGGCCGAG ATGGTGGCCG CTGGGATCAA GCCGAACGAA
GTCACCTTCA ACACCCTCAT CGCCAAGGCC CCCGACTTCA CCGGGGCCGA GGGCGTGCGG
GCCGAGATGG TGGCCGCTGG GATCAAGCCG AACGAAGTCA CCTTCAACAC CCTCGTCGCC
AAGGCCCCCG ACTTCACCGG GGCCGAGGCC GTGCGGGCCG AGATGGTGGC CGCTGGGATC
AAGCCGAACG AAGTCACCTT CAACACCCTC GTCGCCAAGG CCCCCGACTT CACCGGGGCC
GAGGGCGTGC GGGCCGAGAT GGTGGCCGCT GGGATCAAGC CGAACGAAGT CACCTTCAAC
GCCCTCATCG CCAAGGCCCC CGACTTCGCC AGAGCCGAGG CCGTGCGGGC CGAGATGGTG
GCCGCTGGGA TCAAGCCGAA CGAATTCACC TTCAACACCC TCATCGCCAA GGCCGCCGAC
TTCGCCAGAG CCGAGGCCGT GCGGGCCGAG ATGGTGGCCG CTGGGATCAA GCCGAACGAA
TTCACCTTCT CCACCCTCAT CGCCAAGGCC CCCGACTTCA CCGGGGCCGA GGGCGTGCGG
GCCGAGATGG TGGCCGCTGG GATCAAGCCG AACGAATTCA CCTTCAACAC CCTCATCGCC
AAGGCCCCCG ACTTCGCCAG AGCCGAGGCC GTGCGGGCCG AGATGGTGGC CGCTGGGATC
AAGCCGAACG AATTCACCTT CTCCACCCTC GTCGCCAAGG CCCCCGACTT CACCGGGGCC
GAGGCCGTAC GGGCCGAGAT GGTGGCCGCT GGGATCAAGC CGAACGAATT CACCTTCAAC
ACCCTCATCG CCAAGGCCCC CGACTTCGCC AGAGCCGAGG CCGTGCGGGC CGAGATGGTG
GCCGCTGGGA TCAAGCCGAA CGAATTCACC TTCTCCACCC TCATCGCCAA GGCCCCCGAC
TTCACCAGAG CCGAGGCCGT GCGGGCCGAG ATGGTGGCCG CTGGGATCAA GCCGAACGAA
GTCACCTTCA ATATCCTGAT TTCAAGAGCC CCAACGCAAT TGGTGGCAAT GAGATTATTT
CGCGACATGC TTAAGGCGAA TGTGAAGCCG AATAAGCAGC TAGCCACATC GCTGATGAAA
TCAGTTCCGA AGATCACGGA CGCACTCAAG ATTGCGCAAG ATTTTCGCCG GACTGGAGGG
CAAATGGACG TGAAAATGTT TGGCTGCCTT CTGTTTAAGT CGAAGGAGTT TCGCTATCTG
GAGCCAATAT TCTATAAGAT GCTCGAAAAA GGGCCGAAAC CTGATCGGAT GATTTGGCAG
CATGTAATTA TCCACGCCCC CGACAAATCT ACCCGCTTGA AGTTGATTCG AGAAATGCGC
AGCGAAGGAT TGGAACCGGA CGCAATTATG TGGCGGCGCT TATCTAGCTA CGGCATCTAC
GCCGAAGACT TCGATTGA
 
Protein sequence
MRRTDKYFNI ANILVAAAGV ATGTGSVAIA AAGMTAVDSL IKIARSWPRN TDTIIWKVSA 
QLKRSLSDMH LSDEQRLLIL QMIEQVDPTP NAIIACARDA DCLTERLLDS LQADPAHSTI
AAQDGFRQVV SPILRQLLLD PDICDALRPI HEQAVAHGLA VIQAKQEEQT TVQKTQTDQL
SRIEEMIAVA LGYIDTTPDA LAALQAIGNS LPNLPGIVER YQAALARGDE AHARQTLESA
LADATAHRMK TAEAKEVADR QYVDAADAEA RLTEIDAEHK WKGKDRIGAI HRLGHALSIA
ASTALRGELA TLRQRRFNIV LHAQRGYPDV RDLMRLPDMP APDVVTYSIL MAKAPDFTGA
EGVRAEMVAA GIKPNEVTFN TLVAKAPDFA RAEAVRAEMV AAGIKPNEFT FNTLIAKAPD
FTGAEGVRAE MVAAGIKPNE VTFNTLIAKA ADFARAEGVR AEMVAAGIKP NEFTFNTLIA
KAPDFTGAEG VRAEMVAAGI KPNEVTFNTL IAKAADFARA EAVRAEMVAA GIKPNEVTFN
TLIAKAPDFT GAEAVRAEMV AAGIKPNEVT FNTLIAKAAD FARAEAVRAE MVAAGIKPNE
VTFNTLIAKA PDFTGAEAVR AEMVAAGIKP NEVTFNTLIA KAADFARAEA VRAEMVAAGI
KPNEVTFNTL VAKAPDFTGA EGVRAEMVAA GIKPNEVTFN ALIAKAPDFA RAEAVRAEMV
AAGIKPNEVT FNALVAKAPD FARAEAVRAE MVAAGIKPNE VTFNTLIAKA PDFTGAEGVR
AEMVAAGIKP NEVTFNTLVA KAPDFTGAEA VRAEMVAAGI KPNEVTFNTL VAKAPDFTGA
EGVRAEMVAA GIKPNEVTFN ALIAKAPDFA RAEAVRAEMV AAGIKPNEFT FNTLIAKAAD
FARAEAVRAE MVAAGIKPNE FTFSTLIAKA PDFTGAEGVR AEMVAAGIKP NEFTFNTLIA
KAPDFARAEA VRAEMVAAGI KPNEFTFSTL VAKAPDFTGA EAVRAEMVAA GIKPNEFTFN
TLIAKAPDFA RAEAVRAEMV AAGIKPNEFT FSTLIAKAPD FTRAEAVRAE MVAAGIKPNE
VTFNILISRA PTQLVAMRLF RDMLKANVKP NKQLATSLMK SVPKITDALK IAQDFRRTGG
QMDVKMFGCL LFKSKEFRYL EPIFYKMLEK GPKPDRMIWQ HVIIHAPDKS TRLKLIREMR
SEGLEPDAIM WRRLSSYGIY AEDFD