Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3225 |
Symbol | |
ID | 4898580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 282043 |
End bp | 284490 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640113824 |
Product | EcoEI R domain-containing protein |
Protein accession | YP_001045094 |
Protein GI | 126463981 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.376811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGCG TTGGCCTTTC CGAACGAGAC ATCTGCTCCC AGCGGATCAC GCCCGCCATT ATTCAGGCCG GCTGGGACCT CTCAACCCAG ATCAGGGAGG AGGTCAGTTT CACGAAGGGC CGCATCATCG TCCGCGGCAG GCTGGTCAGC AGGGGCAAGG GAAAGCGTGC CGACTACATT CTGTCCGTGC GCCCGAACAT CCGCCTCGCG GTCGTCGAGG CGAAGGACAA CGGTCATGCT GTTGGCGCGG GGATACAGCA AGCGCTTGAG TACGCCGAGA CGCTGAACCT ACCCTTCGCG TTCTCCTCGA ACGGTGAGGG CTTCGTCTTC CACGACCGGA CCGGCCTGAG CCCAACGCCG GAGCGGCTTC TGACACTCGA TGAATTTCCG TCGCCTGCCG AACTCTGGCA ACGGTACTGC GAATGGAAGG GTCTGGATGC CGGGGCGAGC CACGTCACCC TGCAGGACTA TCATGACGAC GGCAGCGGCA AGGAACCGCG GTACTACCAG ATCAACGCCG TGAACGCCGC AATCGAAGCA ATCGCGCGCG GCGATCGCCG CGTGCTGCTC GTCATGGCCA CCGGCACCGG CAAAACCTAC ACCGCATTCC AGATCATCTG GCGCTACCGG CAAGCCTTCC CCGGCAAGCG CGTCTTGTTC CTCGCCGATC GCAATGTCCT GATCGACCAG ACGATGGTCA ACGATTTCCG GCCATTCTCG GGCAAGATGG CCAAGCTCTC TACTCAGGCC AAGACCATCG AGCGCGCCGA CGGGAGCACC GTCGACCTTC CGCTCGGGCT CGACCGCAAG CGGCGGATCG ACCCGTCCTA CGAGATCTAC CTGGGCCTCT ATCAGGCGAT CACCGGCCCT GCGGAAGAGG ACAAGATCTT CCGTGAGTTC TCCAGGGATT TCTTCGACCT CATCATCATC GACGAATGCC ACCGCGGCAG CGCGGCCGAG GACTCGGCCT GGCGGGAGAT CCTCGACCAT TTCTCGGGCG CGGTGCAGAT CGGTATGACG GCCACTCCGA AGGAGACCGA ATACGCCTCG AACATCGCCT ATTTTGGCGC ACCGGTATAC AGCTACACGC TGAGGCAGGG CATTCGCGAC GGCTTCCTGG CGCCGTACAA GGTGATCAAG GTCCACATCG ACCGTGACGT TCAGGGCTAC CGCCCCGAGG CTGGGCAACT CGACCGGGAC GGGCAGGAGG TCGACGACCG GATCTACAAC ATCAAGGACT TCGACCGCAC ACTGGTGCTC GACGACCGGA CCGTCCTCGT GGCGAGAAAG GTCACCGAGT TCCTGAAGGA AAGCGGCGAT CGGATGGCGA AGACCATCGT CTTCTGCGTC GATCAGGAAC ACGCGGCCCG GATGCGCCAA GCCCTGATCA ACGAGAACGC CGACCTGGTA GCGCAGAACT CCCGCTACGT CATGCGCATC ACCGGCAACG ACAAGGAAGG ACTCGACCAG CTCGGGAACT TCATCGATCC GGAAGTGGCC TATCCTGTCA TCGTCACCAC CTCGCGGCTG CTCTCAACCG GTGTGGATGC CCAGACCTGT CGCCTTATCG TGCTCGATCG CGAGGTCGGC TCGATGACCG AGTTCAAGCA AATCGTCGGT CGCGGCACAC GGGTCCACGA GGACACCGGC AAGTTCTACT TCACGCTGAT GGATTTCCGG GGCGCATCGA ACCATTTCGC GGACCCCGCC TTTGATGGCG AACCGGTTCA GATCTACGCT CCGACTGATA CCGACCCGAT CACGCCACCG GAGGACGTGC CGCCAGCGGG CGACGAGGAC GATCCGATCC CGCCCGTTCC CGGGGCGGAC GAGACTGTCG TCCAGGATCC CGATTGGACG GACGCCGGTC CCCGGGAGCC CCTCCGCAAG ATCTACGTCG ACGGGGTTGG CGCCCTGATC ATCGCCGAAC GCGTCGAATA CCTCGATGAA CACGGCAAGC TGATCACCGA GTCTTTGCGC GACTACACCC GCAAGGCACT GAAACGGCGC TTCGCCAGCC TCGACGACTT CCTGAAGCGC TGGAAGACAC AGGAGAAGAA GCAGGCCGTT GTCGAGGAAC TCGAAGCCGA GGGCCTGCCG CTCGACGTGA TCGCGCAGGA ACTGGGGCGG GATCTCGATC CCTTCGACCT GATCTGCCAC ATCGCCTTTG ACGCCAAGCC ATTGACTCGG CGCGAGCGTG CCGACGGCGT GAGGAAGCGC GATGTCTGGG GCCGGTATGG CGACACCGCC CGTGCCGTCC TCAACGCCCT GCTCGACAAG TACGCCGATG ACGGCCTTCT CGACTTCGAC GACCCCAAGA TCCTGAAAAT CAACCCGTTC TCCCAGATGG GAACCGAGGT TGAACTCATC AGGGCTTTCG GCAAGAAGCA GGACTATCTG AAGGCCATGC ATGACTTGCA GGTCGCCCTC TACGAAGAAA GTGCCTGA
|
Protein sequence | MDSVGLSERD ICSQRITPAI IQAGWDLSTQ IREEVSFTKG RIIVRGRLVS RGKGKRADYI LSVRPNIRLA VVEAKDNGHA VGAGIQQALE YAETLNLPFA FSSNGEGFVF HDRTGLSPTP ERLLTLDEFP SPAELWQRYC EWKGLDAGAS HVTLQDYHDD GSGKEPRYYQ INAVNAAIEA IARGDRRVLL VMATGTGKTY TAFQIIWRYR QAFPGKRVLF LADRNVLIDQ TMVNDFRPFS GKMAKLSTQA KTIERADGST VDLPLGLDRK RRIDPSYEIY LGLYQAITGP AEEDKIFREF SRDFFDLIII DECHRGSAAE DSAWREILDH FSGAVQIGMT ATPKETEYAS NIAYFGAPVY SYTLRQGIRD GFLAPYKVIK VHIDRDVQGY RPEAGQLDRD GQEVDDRIYN IKDFDRTLVL DDRTVLVARK VTEFLKESGD RMAKTIVFCV DQEHAARMRQ ALINENADLV AQNSRYVMRI TGNDKEGLDQ LGNFIDPEVA YPVIVTTSRL LSTGVDAQTC RLIVLDREVG SMTEFKQIVG RGTRVHEDTG KFYFTLMDFR GASNHFADPA FDGEPVQIYA PTDTDPITPP EDVPPAGDED DPIPPVPGAD ETVVQDPDWT DAGPREPLRK IYVDGVGALI IAERVEYLDE HGKLITESLR DYTRKALKRR FASLDDFLKR WKTQEKKQAV VEELEAEGLP LDVIAQELGR DLDPFDLICH IAFDAKPLTR RERADGVRKR DVWGRYGDTA RAVLNALLDK YADDGLLDFD DPKILKINPF SQMGTEVELI RAFGKKQDYL KAMHDLQVAL YEESA
|
| |