Gene Rsph17029_3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3226 
Symbol 
ID4898581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp284495 
End bp285940 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content65% 
IMG OID640113825 
ProductN-6 DNA methylase 
Protein accessionYP_001045095 
Protein GI126463982 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCC GCACCCTCGT CAAATCCGTT CAGGACATCA TGCGCAAGGA TACCGGGGTC 
GACGGCGACG CTCAGCGTAT CAGCCAGCTG TGCTGGATGT TCTTCCTCAA GATCATCGAC
GATCAGGACG AGGCCCTCGA ACTCACCCGG GATGAGTACA TCTCACCGAT CCCGGCCGAT
ATGCAATGGC GGGCCTGGGC GGCGGACCCG GAAGGCATGA CGGGTGACGA GCTTCTGTCC
TTCGTCAACG AGTCCCTGTT TCCCCGCCTG AAGAATCTCC GCCCGACCGC GCCGCGCGCC
CGCGTCATCC GCGACGTGTT CGAGGACGCG TACAACTTCA TGAAGTCCGG CCAGCTGCTG
CGGCAAGTGA TCAACAAGAT CAACGGCGTC GACTTCAACA ACCTCACCGA GCGCCAGCAT
TTCGGCGACA TCTACGAACA GCTGCTGAAC GACCTGCAAA ACGCCGGCAA CGCGGGCGAA
TACTACACGC CCCGCGCCGT CACCGCCTTC ATGGTGCAGC AGATCGACCC GCGACCGGGC
GAGATCCTGA TGGACCCGGC CTGCGGCACC GGTGGCTTCC TCACCTGCGC CATGCGCCAC
ATGCGCGATC GGCACATCCG CCTGCCTGAG CATGAGGACC TGATGCAGCG CTCCCTGCGC
GCGGTCGAGA AGAAGCCACT GCCGCACATG CTCTGCGTCA CCAACATGCT GCTGAACGGG
GTCGAGGAGC CGCATTTCGT CCGCCACGAC AACACGCTCG CCCGCCCGCT GACCAGCTGG
ACCCGCGACG AGCGCGTCGA CATCGTGCTG ACCAACCCGC CCTTTGGCGG CAAGGAGGAG
GACGGGATCG AGAACAACTT CCCGACCTTC CGCACCCGCG AAACCGCCGA TCTGTTCCTC
GCCCTGATCA TCCGCCTGCT CAAGCCGGGC GGCCGCGCCG CCGTGGTGCT GCCCGATGGA
TCGCTGTTCG GTGAGGGCAT CAAGACCCGG CTGAAAGAGC ACCTGATGGC GGAATGCAAC
CTGCACACCA TCGTGCGGCT GCCGAACTCG GTCTTCAAGC CCTATGCCTC TATCGGCACC
AACCTGCTGT TCTTCGAAAA GGGCAGCCCC ACCACCGAGA CATGGTTCTG GGAGCATCGC
GTGCCCGAGG GGCAAAAGGC CTATTCGATG ACGAAGCCGA TCCGGCTTGA CCATCTGGCG
GGTTGTGCGG ACTGGTGGGG CGGCGCCAGC CGCGAGGGGC GCGTCGAAGG CCCGCAGGCC
TGGAAGGTCT CGGCCGAGGA GATCAAGGCC CGCGGTTACA ACCTCGACAT CAAGAACCCC
CATGCCGAGG CCGAGGATCT GGGCGACCCC GAGCATCTTC TGGCCGCGCT CGACGAGGCC
GAGGCCGAGG TCGCCCGTCT GCGCGCGGCG CTGAAAGCGA TCCTGACCGA GGCGCTGACG
CGATGA
 
Protein sequence
MSVRTLVKSV QDIMRKDTGV DGDAQRISQL CWMFFLKIID DQDEALELTR DEYISPIPAD 
MQWRAWAADP EGMTGDELLS FVNESLFPRL KNLRPTAPRA RVIRDVFEDA YNFMKSGQLL
RQVINKINGV DFNNLTERQH FGDIYEQLLN DLQNAGNAGE YYTPRAVTAF MVQQIDPRPG
EILMDPACGT GGFLTCAMRH MRDRHIRLPE HEDLMQRSLR AVEKKPLPHM LCVTNMLLNG
VEEPHFVRHD NTLARPLTSW TRDERVDIVL TNPPFGGKEE DGIENNFPTF RTRETADLFL
ALIIRLLKPG GRAAVVLPDG SLFGEGIKTR LKEHLMAECN LHTIVRLPNS VFKPYASIGT
NLLFFEKGSP TTETWFWEHR VPEGQKAYSM TKPIRLDHLA GCADWWGGAS REGRVEGPQA
WKVSAEEIKA RGYNLDIKNP HAEAEDLGDP EHLLAALDEA EAEVARLRAA LKAILTEALT
R