Gene Rsph17025_3534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3534 
Symbol 
ID5085686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp422546 
End bp423991 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content65% 
IMG OID640485093 
ProductEcoEI R domain-containing protein 
Protein accessionYP_001169709 
Protein GI146279551 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0122101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCC GCACCCTCGT CAAATCCGTT CAGGACATCA TGCGCAAGGA TACCGGGGTC 
GACGGCGACG CTCAGCGTAT CAGCCAGCTG TGCTGGATGT TCTTCCTCAA GATCATCGAC
GATCAGGACG AGGCCCTCGA ACTCACCCGC GATGAGTACA TCTCACCGAT CCCGGCCGAT
CTGCAATGGC GGGCCTGGGC GGCCGACCCT GAAGGCATGA CGGGTGACGA GCTTCTGTCC
TTCGTCAACG AGTCCCTGTT TCCCCGCCTG AAGAACCTCC GCCCGACCGC GCCGCGCGCC
CGCGTCATCC GCGACGTGTT CGAGGACGCG TACAACTTCA TGAAGTCCGG CCAGCTGCTG
CGGCAGGTGA TCAACAAGAT CAACGGCGTC GACTTCAACA ACCTCACCGA GCGCCAGCAT
TTCGGCGACA TCTACGAACA GCTGCTGAAC GACCTGCAGA ACGCCGGCAA CGCGGGCGAA
TACTACGACC CTCGCGCCGT CACCGCCTTC ATGGTGCAGC AGATCGACCC GCGACCGGGC
GAGATCCTGA TGGACCCCGC CTGCGGCACC GGTGGCTTCC TCACCTGCGC CATGCGCCAC
ATGCGCGATC GGCACATCCG CCTGCCCGAG CATGAGGACC TGATGCAGCG CTCCCTGCGG
GCGGTCGAGA AGAAGCCACT GCCGCACATG CTCTGCGTCA CCAACATGCT GCTGAACGGG
GTCGAGGAGC CGCATTTCGT CCGCCACGAC AATACGCTCG CCCGCCCGCT CACCAGCTGG
AGCAGGGACG AACGCGTCGA CATCGTGCTG ACCAACCCGC CATTTGGCGG CAAGGAGGAG
GACGGGATCG AGAACAACTT CCCGACCTTC CGCACCCGCG AAACCGCCGA TCTGTTCCTG
GCCCTGATCA TCCGCCTGCT CAAGCCCGGC GGCCGCGCCG CCGTGGTGCT GCCCGATGGA
TCGCTCTTCG GTGAGGGCAT CAAGACCCGG CTAAAAGAGC ACCTGATGGC GGAATGCAAC
CTGCACACCA TCGTGCGGCT GCCGAACTCG GTCTTCAAGC CCTATGCCTC GATCGGCACC
AACCTGCTTT TCTTCGAAAA GGGCAGCCCC ACCTCCGAGA CATGGTTCTG GGAGCATCGC
GTGCCCGAGG GGCAAAAGGC CTATTCGATG ACGAAGCCGA TCCGGCTCGA GCATCTGCAG
CGCTGTGCGG ACTGGTGGGG CGGCGCGAAC CGCGAGGGGC GCGTCGAAGG CCCGGAGGCG
TGGAAAGTCT CGGCCGAGGA AATCAAGGGG CGCGGCTACA ACCTCGACAT CAAGAACCCC
CACGCCGAGG CCGAGGATCT GGGCGACCCC GAGCATCTTC TGGCCGCGCT CGACGGCGCC
GAGGCCGAGG TCACCCGCCT GCGCGAGGCG TTGAAAGCGA TTCTGACCGA GGCGCTGGCG
CGATGA
 
Protein sequence
MSVRTLVKSV QDIMRKDTGV DGDAQRISQL CWMFFLKIID DQDEALELTR DEYISPIPAD 
LQWRAWAADP EGMTGDELLS FVNESLFPRL KNLRPTAPRA RVIRDVFEDA YNFMKSGQLL
RQVINKINGV DFNNLTERQH FGDIYEQLLN DLQNAGNAGE YYDPRAVTAF MVQQIDPRPG
EILMDPACGT GGFLTCAMRH MRDRHIRLPE HEDLMQRSLR AVEKKPLPHM LCVTNMLLNG
VEEPHFVRHD NTLARPLTSW SRDERVDIVL TNPPFGGKEE DGIENNFPTF RTRETADLFL
ALIIRLLKPG GRAAVVLPDG SLFGEGIKTR LKEHLMAECN LHTIVRLPNS VFKPYASIGT
NLLFFEKGSP TSETWFWEHR VPEGQKAYSM TKPIRLEHLQ RCADWWGGAN REGRVEGPEA
WKVSAEEIKG RGYNLDIKNP HAEAEDLGDP EHLLAALDGA EAEVTRLREA LKAILTEALA
R