Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1845 |
Symbol | |
ID | 4897108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1942589 |
End bp | 1944514 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640112437 |
Product | hypothetical protein |
Protein accession | YP_001043721 |
Protein GI | 126462607 |
COG category | [S] Function unknown |
COG ID | [COG3472] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGTAAGC AAGCCGCATT CAAGACGAAC CCGGTCAGCC TGGAAGAACT CCTGCGCCAG TGCGGGAACG GCAAGATCCA GTTGCCCGAC TTCCAGCGTA GCTGGGTCTG GGACGAGGAA CGCATCAAGG GTCTGATCGC GTCGATCTCG CAGGCCTTTC CGGTCGGCGC TCTGATGACC CTTGAGGTCA AGCCGGGCGC AGCGGACACA TTCGCCCGTC GACCGATCCA AGGTGCCGAC GCGGCTGTTG GCGATAGCGC ACCGGATCAG CTCCTGCTGG ATGGCCAGCA GCGCATGACT TCTCTCTACC AGACTTGTTT GCGCCGCGAG GTGGTGCACA CGGTGACACC GCGTCTCAAG CTCGTGAAGC GCTGGTTTTA CATCGACATG CGCAAATCAA TGGACCCTTC CGAGGACCGG GAGAACGCAA TTGTATCCGT TCCAGAGGAT CGGCGGATCA AATCCGACTT CGACCGCAAG GTCGAACTAG ACCTTTCGAC TGCGGAGCTC GAATACGAAA ATCTGATGTT CCCGCTCAAC CAAGTCTTCG ACTCGATGAA CTGGATGATG GGATTCTGGA CCTACTGGAC CCAAAAGGGC GAGTTGGAAA AGACAGAGTT CTTCAAGGCG TTCAACGAAA GCGTCCTTCA GAACTTCAAA TCTTACCAGC TGCCCGTGAT CGCACTGAGC CCCGACACCT CGCACGAGGC CGTCTGTCTC GTCTTCGAAA AGGTGAACAC TGGTGGCAAG CCACTCGACG CGTTCGAACT GGTGACAGCA ATGTATGCGG CCCGCGGCCA TCGCTTGCGC GACGACTGGC TCGGCGCCGA CGGCAAGCCG GGACTGCAGA CCAGGCTCCA GCTCTACGGC CGCGCAGCTG AGCAGAAGTT CGGTGTCCTC GAGAAGGTCG CCGCGACCGA TGTGCTTCAG GCCATTGCGC TCCTGCACGG TGTCGAAAAG CGCGCGGCTG AAATCGCTGC TGGGCGCAAG GAGTCGGAGT TGTCAGCCGT CCGCGCCACG CGCCAGTCGC TGCTCGATCT GCCACTCGAA GCCTACCTGA AACATCGCGG CGCGGTCGAG GAAGGGTTCA AGACAGCGGC ACGTTTCCTC CGTCAGAACC ATATCTACCG GGTCCTCGAC CTACCCTATC AGGGACAGCT TGTCCCCTTC GCCGCGATCC TCGCGATGAT CGGACCCAAG TTCGATCACG CGGCAGTGCG TGACCAACTC GCCCGCTGGT TCTGGTGCGG AATCTTCGGC GAACTCTACG GCTCGGCCAT CGAGTCTCGT TTTGCAAAGG ATGTGCTGGA GGTTCCTGCA TGGCTCGACG GCGGGCCAGA ACCGAGCACG ATCACCGACG GGCGCTTCCG CCCCGAGAGG CTACGTACCC TGCGCACGCG CCTCTCCGCC GCCTACAAGG GTATCCACGC CCTGCTGATG GCTGAAGGCG CGCGGGATCT CCGCTCGGGC CAGCATTTCA AGGACACCGT GTTCTTCGAC GAATATGTCG ATATCCATCA CATCTTCCCG CAGGACTGGT GCAAGAAGCA GAAAATCGAG CCCAAGGTCT TCGACACCGT CATCAACAAG ACCCCGCTCA GCTACAAGAC GAACCGAATT CTCGGGGGCG TCGCGCCATC GGTCTACCTG GAGCGGCTCG AGACGGGCGG GAAGGACACG CCACCCATCT CTCGCGACGC ACTCGACGAA TACCTCGCCT CGCACGCCAT GGATCCCACC CTCCTGCGCG CCGACGATTT CACGGGCTTC ATGGCGGATC GCGAGACGCG GCTGTTGGCC ATGATCTCGC GCGCTACTGG CCACGCTATC ACTCGTTCCG ACGCCGCGCC CGAGGAGGGC GAAGACGTCC CGCAGGATGA CGAGGGATTC GACTTGCCAG ATCCCGATGC AGAGGAGGCC GCCTGA
|
Protein sequence | MGKQAAFKTN PVSLEELLRQ CGNGKIQLPD FQRSWVWDEE RIKGLIASIS QAFPVGALMT LEVKPGAADT FARRPIQGAD AAVGDSAPDQ LLLDGQQRMT SLYQTCLRRE VVHTVTPRLK LVKRWFYIDM RKSMDPSEDR ENAIVSVPED RRIKSDFDRK VELDLSTAEL EYENLMFPLN QVFDSMNWMM GFWTYWTQKG ELEKTEFFKA FNESVLQNFK SYQLPVIALS PDTSHEAVCL VFEKVNTGGK PLDAFELVTA MYAARGHRLR DDWLGADGKP GLQTRLQLYG RAAEQKFGVL EKVAATDVLQ AIALLHGVEK RAAEIAAGRK ESELSAVRAT RQSLLDLPLE AYLKHRGAVE EGFKTAARFL RQNHIYRVLD LPYQGQLVPF AAILAMIGPK FDHAAVRDQL ARWFWCGIFG ELYGSAIESR FAKDVLEVPA WLDGGPEPST ITDGRFRPER LRTLRTRLSA AYKGIHALLM AEGARDLRSG QHFKDTVFFD EYVDIHHIFP QDWCKKQKIE PKVFDTVINK TPLSYKTNRI LGGVAPSVYL ERLETGGKDT PPISRDALDE YLASHAMDPT LLRADDFTGF MADRETRLLA MISRATGHAI TRSDAAPEEG EDVPQDDEGF DLPDPDAEEA A
|
| |