Gene Rsph17029_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1849 
Symbol 
ID4897601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1951458 
End bp1954754 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content64% 
IMG OID640112441 
Producthypothetical protein 
Protein accessionYP_001043725 
Protein GI126462611 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGT CTACCCGTCA GCATGTGTTC GAGGGCATGG AGCTTCTGCC CGAGGCGTTG 
ATCCCCTTTG TCGAGAAGCG GCTGGAAAGC TCGCTGCAGG GCCATTGGCA GGTGCAGGTG
GTCGAGCGCG TCCGGGGCTT GCGGCCCAAC GGCAACGGCC AGGTGAACTG GGACCAGCAG
GGGCTGCTGC AGGCCATGAT GGCGTTCTGG AAGGACGCCT TCGCGATGGT GCTGGGGCAT
CCGGAACGGT CCTACGTCTC CGAGCTGCTC GACGTGCGCA ACAAGCTCTC GCACAACGAG
GCCTTCACCT ATGACGACGC CGAGCGCGCG CTCGACACGA TGCGGCGGCT GCTGGAATCG
GTTAGCGCCA AGGAGACCGC GGAGAAGATC AGCGCCTCGC GCGATACGAT CCTGCGCACG
AAATATGCCG AGCTGGCCCG GAACGAGGAG CGGCGCAGGA CGCAGCGCTC GGACATCTCG
GTGGACACGG TGGCCGGGCT GATGCCGTGG CGCGAGGTGG TCGAGCCGCA CCAGGACGTG
GCCACGGGCG AATTTCAGCA GGCCGAGTTC GCCGCCGACC TGGCCAAGGT GCATAACGGC
AGCGCGCCGT CCGAATACCG CAACCCGCGC GAGTTCTTCG CCCGGACCTA TCTGACCGAG
GGGCTCAGCA CGCTGCTGGT CGGCGCGGCC AAGCGGCTGG GCTCTGGCGG CGGCGATCCA
GTCGTCGAGC TGCAGACGAA CTTCGGCGGC GGCAAGACGC ACTCGATGCT GGCGCTCTAC
CACATGGTGA GCGGGACGCC GGTGGAGGAT CTGCCGGGGC TCGACCAGCT TCTGTCGCGG
AGCGGGCTGA CGGTGCCGGG CAAGATCAAC CGCGCGGTGC TGGTGGGCAC CTCGCGCGGT
CCGCAGGACG TGATCTCGCT TGAGGGCGGC CGGAAGATCC GCACGACCTG GGGCGAGTTG
GCGTGGCAGC TGGGCGGCGC CGAGGCCTTC GAAATGCTAG CCGAGAACGA CGAGCGCGGG
ATCGCGCCGG GATCGAACCT GCTGGAAGCG CTGTTCAAGA AGTACGCGCC CGCGCTGATC
CTGATCGACG AGTGGGTCGC CTACCTGCGA CAGATCTACA AGGTGGAGGG GCTGCCGTCC
GGCTCGTTCG ACGCGAACCT GTCCTTTGTG CAGTCCCTGA CCGAGGCCGT GAAGGCGAGC
CCCGGCGTGC TGCTGGTGGC GTCGCTGCCC GCCTCGCAGA TCGAGGTCGG CGGCGAAGGC
GGACAGGAAG CGTTGGCGCG TCTCAAGCAG ACGTTCAGCC GCGTGGAATC GTCCTGGCGC
CCCGCCAGCC AGGAGGAAAG CTACGAGATC GTCCGCCGCC GCCTGTTCAA GGAAATCCCT
GGCGACAAGT TCCACCATCG CGACAACACG CTGAAGCAGT TCGCCAAGCT GTACCGGGAG
AACGCGAACG ACTTCCCAAA CGGATGCTCC GACGAGGATT ACCGGCGGAA GCTGGAAAAG
GCCTACCCAA TCCATCCGGA GCTGTTCGAC CAGCTCTACA CTAGCTGGGG CTCGCTTGAA
AAATTTCAGC GCACGCGTGG CGTGCTGCGC CTGATGGCGC AGGTGATCCA CGAACTTTGG
ATGGGCAACG ATCCGTCGGT GATGATCATG CCCGGCAGCG TTGCAATCAG CTCGGCGCGC
GTCGAGCCCG AGCTGCTGCA CTATCTCGAC TCCAGCTGGC AATCGATCAT CGCGGGCGAC
GTCGATGGCG TGACGTCAAC GCCGTACAAG ATTGATCAGT CAGCCCCCAA CCTGAACCGG
TACTCGGCGA CCCGTCGCGT TGCACGGGCG GTCTTCATGG GAACAGCGCC AACGCACGGT
CAGGAGAACA AGGGGCTCGA CGACAAGCAG ATCAACCTTG GCGTCGTCCA GCCCGGTGAA
CGTCCGGCGA TCTTTGGCGA CGCCCTGCGC CGCCTCGCAA ACCAGGCTAA GTTCATGCAC
AGTGACCTTG GCCGATACTG GTACTCGATG TCGGCCAGCC TCAACAGACT GGCCGCCGAC
CGCGCCGCTC AGTTCGAGGA AGCACTCGTC CTCCACGAGA TCGACAAGGC GCTCAGCAGC
TACATCAACG GCCTCGCGGA TCGCGGCCAC TTCGACACCG TTCAGGTCGC ACCTGGCAGC
TCGGCCGATA TCCCCGACGA GCCCGGCGGT GTGCGCGCGG TGGTTCTGGG CGTGGCACAC
CCTCACAGCG GTCGGGAGGG ATCAGAGGCG CTGGCGGAGG CGAAGGATGT CATGATGCAG
CGGGGCAGTA CGCCGCGCGT CTACCGGAAC ATGCTGGTCT TCCTCGCCGC CGAGCAACGC
CAGCTGGACA ATCTCAAAGC CGCCCAGCGT GCCGCTCTGG CATGGGCCGA GATTGTTCGA
GAAACGAAAC GGCTCAACCT GACGCAGAGC GACAGCGCGC TGGCGGAAGC GAAACTGAAA
GAGGCAACCG AGACCCTGAA GACACGGATG AAAGAAGCCT GGTGCTATCT GATCTATCCG
GTTCAGGAGA GCGCCCAATC CGATGTGGAG TGGATGTCGG CAAAGGTTCC CGCGCAGGAC
GGGCTGCTCG CTCGCGCCAG CAAGAAACTT GTGAGCGATC AAGGGATCTG GCCCGAGCTT
GGCCCTGACA ACCTGAATCG TCAGCTTGAG AAGTACATCT GGAACGGCAA GCCGCATCTG
CATCTCAAGG ATCTCTGGGA GTATCTGAAC CGTTACACCT ACCTTCCGCG CGTCAAGAAC
AGGGCGGTTC TCTCGAAGGC GGTGCACGCT GCCGTCAGCG GGATGCTGCC TGGCCCCTTC
GCCTATGCAG AGCGCTGGGA CGAGGCGAAG GGGTCCTATG TCGGTCTCGC AATTTCAGGA
GCCTCGAACG CTCAGGTCGT GATCGACAGT GAATCGGTCA TCATCAAGCC AGATGTTGCA
GAGCAGTACA GGCAGAAGCA GACCGCAGCG GCACCAGCAG AGGGCCCGGC ACCCACCGTA
ACACATGGTC CCGAAACGAC TCAGCAACCC GACTCTGGCA CACCGGCAGC CACACCGACC
GAGCAAAAGC CCACCCGTTT CCACGGGACG GTGATGATTT CACCCGAGCG GCCGGCGCGC
GACATCCACC AGATCGTCGA AGCGATCATC GAGCAGCTGA CTACGCTGCC TGGTGCTGAT
GTCTCTATCA AGCTTGAGAT CGACGCAGAG GTGTCTTCTG GCCTTGATCG CGCAAAGGTC
AGGACGCTGG TCGAAAATGC GACGACGCTC GGGTTCATCG ACAAAGCTGT CAAATAG
 
Protein sequence
MAKSTRQHVF EGMELLPEAL IPFVEKRLES SLQGHWQVQV VERVRGLRPN GNGQVNWDQQ 
GLLQAMMAFW KDAFAMVLGH PERSYVSELL DVRNKLSHNE AFTYDDAERA LDTMRRLLES
VSAKETAEKI SASRDTILRT KYAELARNEE RRRTQRSDIS VDTVAGLMPW REVVEPHQDV
ATGEFQQAEF AADLAKVHNG SAPSEYRNPR EFFARTYLTE GLSTLLVGAA KRLGSGGGDP
VVELQTNFGG GKTHSMLALY HMVSGTPVED LPGLDQLLSR SGLTVPGKIN RAVLVGTSRG
PQDVISLEGG RKIRTTWGEL AWQLGGAEAF EMLAENDERG IAPGSNLLEA LFKKYAPALI
LIDEWVAYLR QIYKVEGLPS GSFDANLSFV QSLTEAVKAS PGVLLVASLP ASQIEVGGEG
GQEALARLKQ TFSRVESSWR PASQEESYEI VRRRLFKEIP GDKFHHRDNT LKQFAKLYRE
NANDFPNGCS DEDYRRKLEK AYPIHPELFD QLYTSWGSLE KFQRTRGVLR LMAQVIHELW
MGNDPSVMIM PGSVAISSAR VEPELLHYLD SSWQSIIAGD VDGVTSTPYK IDQSAPNLNR
YSATRRVARA VFMGTAPTHG QENKGLDDKQ INLGVVQPGE RPAIFGDALR RLANQAKFMH
SDLGRYWYSM SASLNRLAAD RAAQFEEALV LHEIDKALSS YINGLADRGH FDTVQVAPGS
SADIPDEPGG VRAVVLGVAH PHSGREGSEA LAEAKDVMMQ RGSTPRVYRN MLVFLAAEQR
QLDNLKAAQR AALAWAEIVR ETKRLNLTQS DSALAEAKLK EATETLKTRM KEAWCYLIYP
VQESAQSDVE WMSAKVPAQD GLLARASKKL VSDQGIWPEL GPDNLNRQLE KYIWNGKPHL
HLKDLWEYLN RYTYLPRVKN RAVLSKAVHA AVSGMLPGPF AYAERWDEAK GSYVGLAISG
ASNAQVVIDS ESVIIKPDVA EQYRQKQTAA APAEGPAPTV THGPETTQQP DSGTPAATPT
EQKPTRFHGT VMISPERPAR DIHQIVEAII EQLTTLPGAD VSIKLEIDAE VSSGLDRAKV
RTLVENATTL GFIDKAVK