Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1848 |
Symbol | |
ID | 4897233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1948540 |
End bp | 1951446 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640112440 |
Product | hypothetical protein |
Protein accession | YP_001043724 |
Protein GI | 126462610 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA AGCTCATCGA AGTAGCCATT CCACTTGAGG CGATTAATGC GGCGTCGGCG CGAGAAAAAT CGATCCGACA TGGGCATCCT TCCACCCTGC ACCTTTGGTG GGCGCGGCGG CCGCTCGCGG CCTGTCGGGC GGTGTTGTTC GCGCAGCTGG TGGACGATCC CTCGTCGCGG GTGGACGAGC TGATGGCCGA CCCGAAGCTG CGGGCGCAGG CCGAGGTGGA GCTGCCCGCG CGGCTGGCGG CCTGGGAGAA GAGCAAGGCG GCTGCACAGG GCGCAGCGGC GAATGCGCCA GAGCCGACGC TGGAGGATGT GGCGGTCGAG ATTGAGCGCA GGCGGCTGTT TGCCATCATC GAGGACTTGG TGAAGTGGGA GAACTCGACC AACGAGGAGG TGCTGGAACG CGCGCGGGCC GAGATCCGGC GCAGCTGCGG CGGCGTGTTG CCCCCTGTCT ATGACCCGTT CTCGGGCGGC GGGTCGATCC CGCTCGAGGC GCAGCGGTTG GGCCTACCCG CCTATGGGTC TGACCTGAAC CCGGTGGCGG TGATGATCGG AAAGGCGATG ATCGAGATCC CGCCGAAATT CAAGAACATG CCGCCCATCC ACCCCGGCAT CAAAGAGCGG TCGTTCTACC GCAACGCCGA AGGGCTGGCC GAAGACGTGA AATATTACGG CGAATGGATG CGGGAGAAGG CATGGGATCG CATCGGGCAT CTCTACCCGC AGGTGGACTT GCCGAAGGAG TACGGCGGCG GCAAGGCGAC GGTGATCGCC TGGATCTGGG CGCGGACGGT GCCGAGCCCG GACCCGGCCT TTTCTGAAGT GCAGGTACCG ATCGCGTCCA GTTTCCTGCT AAGCGCGAAA CCGGGAAAAG AAGCTTGGGT CGAGCCGACT GTCGAACGAG CAACCAAGAG AATCACATAC CGGATACGCC AAAAGGGCAC AAAGGCTGAA ATTGCAGAGG CGAAGAATGG CACAAAGGCG GGACGTGGCG CGAACTTCCG TTGCCTGATT TCGGACACCG CCATTACACC CGACTATGTC AAGCGCATGG GCCGCGATGG GCGGATGGGC CAGACGCTTC TAGCTATCGT GGCTGAGGGA AAAGGTGGGC GTGCTTACGT AGCGCCAACT AACGACCAGG TCGCCACCGC CATCTCGGCA GAACCCACTT GGCGACCTGA GGCCGCGTTG CCGAATGATC CACGGAATTT CTGGACAGTA GATTACGGAC TCACAAATTT TGGCGATCTA TTCACAAGCC GCCAACTTGT CGCGCTGAAT ACATTAAGCG AACTGGTACA TGAAGTCCGT GCAAAGATAG AAAGCGATGC AGCTTCTTCT GGCCTTATCG CTGATGGGAT GCCACTTCGG GATGGGGGCA AAGGAGCCCT TGCATATGCC GAAGCCGTAA GTATCTATCT CGGCTTCCTT ATTGGACAGG TCGCCAACCA TTGTTCAACC ATCTGTGGAT GGAACAGTCC TAACCAGCAG ATGCGATCCA CTTTTTCGCG GCAATCCCTA CCAATGACGT GGGATTTTGC CGAGGTAAAC GTATTTAGCG AATCCAGCGG AAGCTACCAC AGCCTCTTCA CAAGGATGGT GAAGGGATTT GAAGTGCTTG GGGCAAGCGA TGAGAAATCT GCGATCACGC AAAGTGATGC GCAGGGCGTC CAGTACCCCC TAGACACTGC GATCTCTACG GATCCGCCGT ACTACGACAA TATTGGCTAC GCAGACTTGT CTGATTTCTT CTTCTGCTGG ATGAAGCCAG CATTGAGGGC AATATATCCC GATTTGTTTT CTCTGATAAC AACACCAAAG GCGGAAGAAT TGGTGGCAAC CCCTTATCGT CATGGAGGAA AGGATGCCGC AGAAGCCTTT TTTCTTGATG GAATGAGTCG GGCCATCGCA CGAATGGCGG AGGCTGGAAG TGGTGCCTTT CCAGCCACAA TCTACTACGC ATTTAAGCAG AGTGAGATTG AACAAGAGGG TATCAGCTCA ACAGGCTGGG CGACATTCGT TCAATCGGTC ATGGACGCCG GCTATTCTGT GGTGGGAACC TGGCCTCTCC GTACAGAAAA GCCGGGACGG ATGATTGCGG TTGGGACAAA CGCGCTCGCA AATTCTGTCG TGCTCGTCTG CCGCAAGAAG GATGCCAAGG CTGACACGAT CACTCGCGCC GAGTTCATTC GCGCTCTGAA ACGCGAACTG CCCCCGGCCA TCGCCGAGCT TCAAGCCGCC AGCATCGCTC CGGCCGACAT GCCGCAGTCG GCTATCGGCC CCGGCATGGG CGTCTTCTCG CGCTACCGCG CCGTGCTCGA GGCCGACGAC AGTGCGATGA CGGTCAAGAC CGCGCTGCAG CTGATCAATG CCGAGCTCGA CGAATATCTC GGCGGCATCC AGGGCGAGTT CGACGCCGAT ACTCGCTTCG CCATCACCTG GTTCGAACAG AACGGCATGG GCAAGGGAGA CTTCGGCGCC GCCGACAGCC TCGCCCGCGC CCGCGGCATC GCAGTCGACA GCGTGAAGCA TGCCGGGATC GTCGAAAGCG CGGCGGGAAA GGTGCGCCTG TTGAAGCGCG ACGAGCTCGA TCCCGATTGG GCGCCCGAGG AGGACGGACA TCTGACCGTC TGGGAATGCC TGCAGCACCT CGTGCGCCTG CACGAAAAGG AGGGCCTGTC TCACGACACT GCGGCGCTGC TGAAACGCTT CGGGCCCCAG GCCGAGGCAG TGAAGGATCT GGCCTACTGC CTCTACGACA TCGCCGCCAA CAAGCGGCGC GAGGCCTCCG AGGCCACGGT CTACAACGCC CTGATCGCCG ACTGGTCAGA GCTGAGCCAG ATGGCCGCCA CGGTTTCGCT TGAAGGGCGG AACCGGCAAA CGCGATTTGA ACTGTAA
|
Protein sequence | MKKKLIEVAI PLEAINAASA REKSIRHGHP STLHLWWARR PLAACRAVLF AQLVDDPSSR VDELMADPKL RAQAEVELPA RLAAWEKSKA AAQGAAANAP EPTLEDVAVE IERRRLFAII EDLVKWENST NEEVLERARA EIRRSCGGVL PPVYDPFSGG GSIPLEAQRL GLPAYGSDLN PVAVMIGKAM IEIPPKFKNM PPIHPGIKER SFYRNAEGLA EDVKYYGEWM REKAWDRIGH LYPQVDLPKE YGGGKATVIA WIWARTVPSP DPAFSEVQVP IASSFLLSAK PGKEAWVEPT VERATKRITY RIRQKGTKAE IAEAKNGTKA GRGANFRCLI SDTAITPDYV KRMGRDGRMG QTLLAIVAEG KGGRAYVAPT NDQVATAISA EPTWRPEAAL PNDPRNFWTV DYGLTNFGDL FTSRQLVALN TLSELVHEVR AKIESDAASS GLIADGMPLR DGGKGALAYA EAVSIYLGFL IGQVANHCST ICGWNSPNQQ MRSTFSRQSL PMTWDFAEVN VFSESSGSYH SLFTRMVKGF EVLGASDEKS AITQSDAQGV QYPLDTAIST DPPYYDNIGY ADLSDFFFCW MKPALRAIYP DLFSLITTPK AEELVATPYR HGGKDAAEAF FLDGMSRAIA RMAEAGSGAF PATIYYAFKQ SEIEQEGISS TGWATFVQSV MDAGYSVVGT WPLRTEKPGR MIAVGTNALA NSVVLVCRKK DAKADTITRA EFIRALKREL PPAIAELQAA SIAPADMPQS AIGPGMGVFS RYRAVLEADD SAMTVKTALQ LINAELDEYL GGIQGEFDAD TRFAITWFEQ NGMGKGDFGA ADSLARARGI AVDSVKHAGI VESAAGKVRL LKRDELDPDW APEEDGHLTV WECLQHLVRL HEKEGLSHDT AALLKRFGPQ AEAVKDLAYC LYDIAANKRR EASEATVYNA LIADWSELSQ MAATVSLEGR NRQTRFEL
|
| |