Gene Rsph17029_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1858 
Symbol 
ID4897276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1962148 
End bp1963518 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content59% 
IMG OID640112450 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001043734 
Protein GI126462620 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGCT ATCCGGCCTA CAAGGACAGC GGCGTGGAGT GGCTGGGGGA GGTGCCGGAG 
GGATGGGAGG TAAAATGCCT CAGGATGATT GCCGATGAGT TGCAGACTGG CCCATTCGGG
AGTCAGCTTC ACACGGAAGA CTACGTCACT GCGGGCGTTC CGATTGTTAA TCCCTCGAAC
ATATTGGACG GCCAAATCGT TCCTGACGAT GAAATCGGTG TAGATGAGGC AACAGCATTA
CGACTTGCCA ATCATGCGTT GTTGCCGGGC GACATTATTC TTGGCCGACG CGGAGAACTC
GGCAGATGCG CAGTCGTGCC AGACGGAACA ATGCCCCTCC TATGCGGCAC TGGTTCCCTT
CGGATCAGGC TGAAAAGCAG TCAGGCGCTT CCCGATTTTA TAGCGGAGTG CATCCGCACG
CCTCGTGTTC GCGAATGGCT ATCTCTGCAA AGCGTCGGGT CCACCATGGA CAACCTGAAC
ACCGCCATCG TCGGAAAGAT ACAGATTGCG CTCCCCTCTC TGCCCGAGCA GCGAGCCATC
ACCGCCTTCC TCAACCGGGA GACGGCGAAG ATCGACGCGC TGGTGGAGGA ACAGCGGCGG
CTGATCGCGC TGTTGGCCGA GAAGCGTCAG GCCGTCCTCA ACCACGCCGT CACCCGCGGC
CTGAACCCCG ACGCCCTCCT CAAACCCTCG GGCATCGATT GGCTCGGGGA TATTCCCGAG
GGTTGGGAGG TGGTGCCTAT CCGAAAGGTG GCGCGTCTGG AATCGGGCCA TACGCCAAGC
AGATCCCGCC CGGAGTGGTG GGTGGACTGC CATATCCCTT GGTTTTCGCT TGCCGATATT
TGGCAGGTCC GTCCGGGGCG GGTCGAATAC GTGTACGAAA CAGCCGAAGC GGTTTCGGAA
CTCGGGCTAC AAAACTCTTC GGCACGTTTA CTTCCAGCGG GCACGGTCAT GCTTTCCCGG
ACCGCATCGG TCGGCTTCTC TGCAGTCATG GGCATTGCAA TGGCGACGAC TCAGGATTTT
GCGAATTGGG TTTGCGGCTG CCGCCTGCTT CCGGATTACC TTCTTTACTG TCTTCGGGGG
ATGCCCAGCG AGTTCGAGCG GTTAAAGATG GGGTCAACCC ACAACACTAT CTACATGCCT
GATATCCGGA CGTTGACGAT TCCCTTGCCG CCTCTGGAGG AACAGAAGGC CATCGTGGAT
CATGTCCGCG CGAGCGTAGG AGCGTTGGAC GAGTTGATGG ACACCGCAAC CACCGCCATC
ACCCTCCTTC AGGAACGTCG TGCCGCGTTG ATCTCGGCGG CGGTGACCGG CAAGATCGAC
GTCCGAGACC TTTCTCCGCA ATCCCTTTCC GACTGCCTGG AACCCGCGTG A
 
Protein sequence
MRRYPAYKDS GVEWLGEVPE GWEVKCLRMI ADELQTGPFG SQLHTEDYVT AGVPIVNPSN 
ILDGQIVPDD EIGVDEATAL RLANHALLPG DIILGRRGEL GRCAVVPDGT MPLLCGTGSL
RIRLKSSQAL PDFIAECIRT PRVREWLSLQ SVGSTMDNLN TAIVGKIQIA LPSLPEQRAI
TAFLNRETAK IDALVEEQRR LIALLAEKRQ AVLNHAVTRG LNPDALLKPS GIDWLGDIPE
GWEVVPIRKV ARLESGHTPS RSRPEWWVDC HIPWFSLADI WQVRPGRVEY VYETAEAVSE
LGLQNSSARL LPAGTVMLSR TASVGFSAVM GIAMATTQDF ANWVCGCRLL PDYLLYCLRG
MPSEFERLKM GSTHNTIYMP DIRTLTIPLP PLEEQKAIVD HVRASVGALD ELMDTATTAI
TLLQERRAAL ISAAVTGKID VRDLSPQSLS DCLEPA