Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1849 |
Symbol | |
ID | 4897601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1951458 |
End bp | 1954754 |
Gene Length | 3297 bp |
Protein Length | 1098 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640112441 |
Product | hypothetical protein |
Protein accession | YP_001043725 |
Protein GI | 126462611 |
COG category | [R] General function prediction only |
COG ID | [COG1483] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGT CTACCCGTCA GCATGTGTTC GAGGGCATGG AGCTTCTGCC CGAGGCGTTG ATCCCCTTTG TCGAGAAGCG GCTGGAAAGC TCGCTGCAGG GCCATTGGCA GGTGCAGGTG GTCGAGCGCG TCCGGGGCTT GCGGCCCAAC GGCAACGGCC AGGTGAACTG GGACCAGCAG GGGCTGCTGC AGGCCATGAT GGCGTTCTGG AAGGACGCCT TCGCGATGGT GCTGGGGCAT CCGGAACGGT CCTACGTCTC CGAGCTGCTC GACGTGCGCA ACAAGCTCTC GCACAACGAG GCCTTCACCT ATGACGACGC CGAGCGCGCG CTCGACACGA TGCGGCGGCT GCTGGAATCG GTTAGCGCCA AGGAGACCGC GGAGAAGATC AGCGCCTCGC GCGATACGAT CCTGCGCACG AAATATGCCG AGCTGGCCCG GAACGAGGAG CGGCGCAGGA CGCAGCGCTC GGACATCTCG GTGGACACGG TGGCCGGGCT GATGCCGTGG CGCGAGGTGG TCGAGCCGCA CCAGGACGTG GCCACGGGCG AATTTCAGCA GGCCGAGTTC GCCGCCGACC TGGCCAAGGT GCATAACGGC AGCGCGCCGT CCGAATACCG CAACCCGCGC GAGTTCTTCG CCCGGACCTA TCTGACCGAG GGGCTCAGCA CGCTGCTGGT CGGCGCGGCC AAGCGGCTGG GCTCTGGCGG CGGCGATCCA GTCGTCGAGC TGCAGACGAA CTTCGGCGGC GGCAAGACGC ACTCGATGCT GGCGCTCTAC CACATGGTGA GCGGGACGCC GGTGGAGGAT CTGCCGGGGC TCGACCAGCT TCTGTCGCGG AGCGGGCTGA CGGTGCCGGG CAAGATCAAC CGCGCGGTGC TGGTGGGCAC CTCGCGCGGT CCGCAGGACG TGATCTCGCT TGAGGGCGGC CGGAAGATCC GCACGACCTG GGGCGAGTTG GCGTGGCAGC TGGGCGGCGC CGAGGCCTTC GAAATGCTAG CCGAGAACGA CGAGCGCGGG ATCGCGCCGG GATCGAACCT GCTGGAAGCG CTGTTCAAGA AGTACGCGCC CGCGCTGATC CTGATCGACG AGTGGGTCGC CTACCTGCGA CAGATCTACA AGGTGGAGGG GCTGCCGTCC GGCTCGTTCG ACGCGAACCT GTCCTTTGTG CAGTCCCTGA CCGAGGCCGT GAAGGCGAGC CCCGGCGTGC TGCTGGTGGC GTCGCTGCCC GCCTCGCAGA TCGAGGTCGG CGGCGAAGGC GGACAGGAAG CGTTGGCGCG TCTCAAGCAG ACGTTCAGCC GCGTGGAATC GTCCTGGCGC CCCGCCAGCC AGGAGGAAAG CTACGAGATC GTCCGCCGCC GCCTGTTCAA GGAAATCCCT GGCGACAAGT TCCACCATCG CGACAACACG CTGAAGCAGT TCGCCAAGCT GTACCGGGAG AACGCGAACG ACTTCCCAAA CGGATGCTCC GACGAGGATT ACCGGCGGAA GCTGGAAAAG GCCTACCCAA TCCATCCGGA GCTGTTCGAC CAGCTCTACA CTAGCTGGGG CTCGCTTGAA AAATTTCAGC GCACGCGTGG CGTGCTGCGC CTGATGGCGC AGGTGATCCA CGAACTTTGG ATGGGCAACG ATCCGTCGGT GATGATCATG CCCGGCAGCG TTGCAATCAG CTCGGCGCGC GTCGAGCCCG AGCTGCTGCA CTATCTCGAC TCCAGCTGGC AATCGATCAT CGCGGGCGAC GTCGATGGCG TGACGTCAAC GCCGTACAAG ATTGATCAGT CAGCCCCCAA CCTGAACCGG TACTCGGCGA CCCGTCGCGT TGCACGGGCG GTCTTCATGG GAACAGCGCC AACGCACGGT CAGGAGAACA AGGGGCTCGA CGACAAGCAG ATCAACCTTG GCGTCGTCCA GCCCGGTGAA CGTCCGGCGA TCTTTGGCGA CGCCCTGCGC CGCCTCGCAA ACCAGGCTAA GTTCATGCAC AGTGACCTTG GCCGATACTG GTACTCGATG TCGGCCAGCC TCAACAGACT GGCCGCCGAC CGCGCCGCTC AGTTCGAGGA AGCACTCGTC CTCCACGAGA TCGACAAGGC GCTCAGCAGC TACATCAACG GCCTCGCGGA TCGCGGCCAC TTCGACACCG TTCAGGTCGC ACCTGGCAGC TCGGCCGATA TCCCCGACGA GCCCGGCGGT GTGCGCGCGG TGGTTCTGGG CGTGGCACAC CCTCACAGCG GTCGGGAGGG ATCAGAGGCG CTGGCGGAGG CGAAGGATGT CATGATGCAG CGGGGCAGTA CGCCGCGCGT CTACCGGAAC ATGCTGGTCT TCCTCGCCGC CGAGCAACGC CAGCTGGACA ATCTCAAAGC CGCCCAGCGT GCCGCTCTGG CATGGGCCGA GATTGTTCGA GAAACGAAAC GGCTCAACCT GACGCAGAGC GACAGCGCGC TGGCGGAAGC GAAACTGAAA GAGGCAACCG AGACCCTGAA GACACGGATG AAAGAAGCCT GGTGCTATCT GATCTATCCG GTTCAGGAGA GCGCCCAATC CGATGTGGAG TGGATGTCGG CAAAGGTTCC CGCGCAGGAC GGGCTGCTCG CTCGCGCCAG CAAGAAACTT GTGAGCGATC AAGGGATCTG GCCCGAGCTT GGCCCTGACA ACCTGAATCG TCAGCTTGAG AAGTACATCT GGAACGGCAA GCCGCATCTG CATCTCAAGG ATCTCTGGGA GTATCTGAAC CGTTACACCT ACCTTCCGCG CGTCAAGAAC AGGGCGGTTC TCTCGAAGGC GGTGCACGCT GCCGTCAGCG GGATGCTGCC TGGCCCCTTC GCCTATGCAG AGCGCTGGGA CGAGGCGAAG GGGTCCTATG TCGGTCTCGC AATTTCAGGA GCCTCGAACG CTCAGGTCGT GATCGACAGT GAATCGGTCA TCATCAAGCC AGATGTTGCA GAGCAGTACA GGCAGAAGCA GACCGCAGCG GCACCAGCAG AGGGCCCGGC ACCCACCGTA ACACATGGTC CCGAAACGAC TCAGCAACCC GACTCTGGCA CACCGGCAGC CACACCGACC GAGCAAAAGC CCACCCGTTT CCACGGGACG GTGATGATTT CACCCGAGCG GCCGGCGCGC GACATCCACC AGATCGTCGA AGCGATCATC GAGCAGCTGA CTACGCTGCC TGGTGCTGAT GTCTCTATCA AGCTTGAGAT CGACGCAGAG GTGTCTTCTG GCCTTGATCG CGCAAAGGTC AGGACGCTGG TCGAAAATGC GACGACGCTC GGGTTCATCG ACAAAGCTGT CAAATAG
|
Protein sequence | MAKSTRQHVF EGMELLPEAL IPFVEKRLES SLQGHWQVQV VERVRGLRPN GNGQVNWDQQ GLLQAMMAFW KDAFAMVLGH PERSYVSELL DVRNKLSHNE AFTYDDAERA LDTMRRLLES VSAKETAEKI SASRDTILRT KYAELARNEE RRRTQRSDIS VDTVAGLMPW REVVEPHQDV ATGEFQQAEF AADLAKVHNG SAPSEYRNPR EFFARTYLTE GLSTLLVGAA KRLGSGGGDP VVELQTNFGG GKTHSMLALY HMVSGTPVED LPGLDQLLSR SGLTVPGKIN RAVLVGTSRG PQDVISLEGG RKIRTTWGEL AWQLGGAEAF EMLAENDERG IAPGSNLLEA LFKKYAPALI LIDEWVAYLR QIYKVEGLPS GSFDANLSFV QSLTEAVKAS PGVLLVASLP ASQIEVGGEG GQEALARLKQ TFSRVESSWR PASQEESYEI VRRRLFKEIP GDKFHHRDNT LKQFAKLYRE NANDFPNGCS DEDYRRKLEK AYPIHPELFD QLYTSWGSLE KFQRTRGVLR LMAQVIHELW MGNDPSVMIM PGSVAISSAR VEPELLHYLD SSWQSIIAGD VDGVTSTPYK IDQSAPNLNR YSATRRVARA VFMGTAPTHG QENKGLDDKQ INLGVVQPGE RPAIFGDALR RLANQAKFMH SDLGRYWYSM SASLNRLAAD RAAQFEEALV LHEIDKALSS YINGLADRGH FDTVQVAPGS SADIPDEPGG VRAVVLGVAH PHSGREGSEA LAEAKDVMMQ RGSTPRVYRN MLVFLAAEQR QLDNLKAAQR AALAWAEIVR ETKRLNLTQS DSALAEAKLK EATETLKTRM KEAWCYLIYP VQESAQSDVE WMSAKVPAQD GLLARASKKL VSDQGIWPEL GPDNLNRQLE KYIWNGKPHL HLKDLWEYLN RYTYLPRVKN RAVLSKAVHA AVSGMLPGPF AYAERWDEAK GSYVGLAISG ASNAQVVIDS ESVIIKPDVA EQYRQKQTAA APAEGPAPTV THGPETTQQP DSGTPAATPT EQKPTRFHGT VMISPERPAR DIHQIVEAII EQLTTLPGAD VSIKLEIDAE VSSGLDRAKV RTLVENATTL GFIDKAVK
|
| |