Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3227 |
Symbol | |
ID | 4898582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 285937 |
End bp | 287664 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640113826 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001045096 |
Protein GI | 126463983 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCG ATCAGCTTTT GCGCCTTTAC GATCGGGTTT GCGAGGCCGA GGACGCCGTC CCCCGCCTGC GCCGGCTTGT CCTCGATCTG GCCGTGCGCG GCAAACTGGT GCCGCAAGAC CCGGAAGATG AACCGGCGGC CGAGTTGCTG AAGCGGATCG AGAAGGAAAA GGCCCGGCTG GTGAAGGCGG GGGAGATCCG GAAGCCGAAA TCCATAGAAA GCCCGGATGA CGCCCCTTTT AACCTTCCGG CGAACTGGGC ATGGAGCAAT ATCGCATCTC TTGGTTCCGT CTCGCCTCGT AACGAAGCCG AAGATGACGC AATGGCGTCA TTCGTGCCCA TGACTCTTAT TCCGACCGAA ATCAGGGCGG CAAATGGACA TGAGCCGCGT CATTGGCGTG AGATTAAGAA AGGCTACACT CATTTTGCAG AAGGCGATGT AGGGCTGGCG AAGATAACGC CCTGTTTCGA GAATGGCAAA TCTGCCGTAT TTCGCGGTCT GACAGGTGGC TTCGGTGCGG GAACGACCGA GCTTCACATT GTTCGCCCGA TCTTCGTTTC ACCAGACTAC ATTCTGACAT ACTTGAAAAG CCCGCAATTC ATCGAAAACG GGATTCCCCG GATGACCGGC ACAGCGGGGC AAAAACGCGT TCCGACGGAG TATTTCATCG GCACCCCTTT CCCCCTCCCA CCCCTTGCCG AGCAACACCG CATAGTCGCC AAGGTCGAGG AACTGATGGC GCTGCTCGAC CGGATCGAGG CGGCGCGGGC GGGACGCGAG GAGACGCGCA ACCGCCTGAC CGCCGCAACC CTTGCCCGCC TGACCGACCC CAAGGCCGAC GCCCCCGCCG CCACCCGGTT CGCGCTGGAC ACGCTCGCCC CCCTCACCAC CCGCCCGGAC CAGATCAAGA CCCTGCGCCA AACCATCCTC AACCTTGCCG TTCGAGGCAG GCTCGTCCCG CAAGACCCTG CGGATGAACC GGCGTCGGAG TTGTTGAAAC GGGTATCCGT CGAACGCGCT CGACTTGAGA AAGCAGGAGC CATCCGATCA ACGAAGCGCG CTGCATCCCT TGAAGGTACA AAATTACGGT TCAACCCACC GCCAAGATGG CGCTGGACGA ATCTTGAATG TCTCTTTGCA ATAACAGGGG GAATACAGAA GACACCGGGT AGAATGCCAA AAGCCAATGC GTTTCCGTAT TTGGGCGTCG GAAATGTTTA CCGGAATCGG ATTGATCTAA CTAATCTGAA GAAATTCGAA TTGCAAGACG GTGAAGTTGA TAAGTTCGGC CTGCAACCGT TCGACATTTT GGTTGTTGAA GGCAACGGAA GCGCAACGGA AATCGGACGT TGTGCAATGT GGGAAGGTCA GATTGAGCAA TGCGTTCATC AAAATCACCT GATACGGTGC CGGCCCATTG ATCCGAACCT GTCGCGATAT GCGTTGCTCT ACCTCAATTC CCCGCTTGGA ATGGACGAGA TGACGGAGTT GGCCATCACA TCGGCAGGTC TTTACAACCT CAGCGTAGGC AAAATCTCAA CAGTTCCCCT TCCCCTCCCG CCCCTCGCCG AACAGCACCG CATCGTGGCC AAGGTCGACG CGCTGATGCG CCTCCTCGAC GATCTCGAGG CAGCGCTGAG CGCCTCGTCC ACCACCCGCG CCCGCCTCCT CGACGCCACC CTGCGCGCCG CGCTGGCCGA GCCGGGGCAC ACGAGGGCCG CCGCATGA
|
Protein sequence | MNADQLLRLY DRVCEAEDAV PRLRRLVLDL AVRGKLVPQD PEDEPAAELL KRIEKEKARL VKAGEIRKPK SIESPDDAPF NLPANWAWSN IASLGSVSPR NEAEDDAMAS FVPMTLIPTE IRAANGHEPR HWREIKKGYT HFAEGDVGLA KITPCFENGK SAVFRGLTGG FGAGTTELHI VRPIFVSPDY ILTYLKSPQF IENGIPRMTG TAGQKRVPTE YFIGTPFPLP PLAEQHRIVA KVEELMALLD RIEAARAGRE ETRNRLTAAT LARLTDPKAD APAATRFALD TLAPLTTRPD QIKTLRQTIL NLAVRGRLVP QDPADEPASE LLKRVSVERA RLEKAGAIRS TKRAASLEGT KLRFNPPPRW RWTNLECLFA ITGGIQKTPG RMPKANAFPY LGVGNVYRNR IDLTNLKKFE LQDGEVDKFG LQPFDILVVE GNGSATEIGR CAMWEGQIEQ CVHQNHLIRC RPIDPNLSRY ALLYLNSPLG MDEMTELAIT SAGLYNLSVG KISTVPLPLP PLAEQHRIVA KVDALMRLLD DLEAALSASS TTRARLLDAT LRAALAEPGH TRAAA
|
| |