Gene RPD_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1972 
Symbol 
ID4022454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2210219 
End bp2211691 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content56% 
IMG OID637962165 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_569108 
Protein GI91976449 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0825702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.570166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCACG CTCTGTTGCA GCGTGTCGAG ATCTCGGAAA CGGATGGGAT GCGAGGGGCA 
GACGGTTTAT CGGTGGAGCT GCGGGTCGCT TCGTTGTTTG CGGGAATCGG AGGGTTCGAC
AAAGCTTTTG AATCGGTCTC AGCCTCCGTG GTCGCCCAGT GCGAGATTGA CTCCTTCTGC
CGCGCCGTAC TCAGGCGACA TTGGCCACAG ACTAAGCTTT TCGAAGATAT TACCAAGATC
AACCCAGCCG AATTTCCAGC GGCCGACATT TGGACCGCAG GCTTTCCCTG CCAAGATGTT
TCGCTCGCGC GCGGTAACCA CGGTCGAGAT GGGCTCAAAG GCAATCACAC GAGTCTTTTC
TTTAAGTTGA TGGATCTGGC CGAGGCTAAG AAGCCTAAGA TCATCCTGCT TGAAAACGTC
GTTGGCCTAC TCAATTCACA TCAGGGTTGT GATTTTGCAA TCATCTTGCG TGAGCTAACT
AATCAAGGAT ACGCCGTTTC TTGGCGTGTT CTGAATGCCC GTTACTTCGG CTCACCGCAA
TCGAGATCGC GCGTCTTTAT GGTGGCTTGG CGCGGCGACT ACAGGTTGGC GCTTGCGTCC
CTTTTCGAGC CGGTACGTGG TGCGAAGACG GCGGCCGAAC GCAAGGGATT TGTGACAAAG
ACGACGCATG CAAAGACCGG AGCGATTGTG CCTCAGGTCG CATATTGTGT CGCTGCAACG
TCGGGTAGGC ACACCGGAAA CGATTGGGCG CGTTCCTATA TTTCCTACAA GGATCGTGTC
CGCAGGCCGA CTGTGAGCGA AAGCGAGCGT TTGCAAGGTT TCGAAGCCGG GTGGACTGTG
CCTGGCGCTG GCTACCGCGA ACCCGCGCGC GGTTTCGATT CCGAGCGCTA CCGCGCGGTC
GGTAATGCAG TCGCGGTGCC TGTGGTCAGG TGGATCGCTC AGCGAATGAC AGCGGCGCTA
GCGCAGCCGA AAGCCCCATC AAGCCGCCGT GGTTTTATGG AGGAGTGCTT GCTCATCGCG
CCCGATCTTG CGAACTCGAC GGAGACACTT CGCTTTTCAG ACATTATGGA GGAGGTCAAC
AAGGGGGAAT TCGTCTACCG CTGGAAGGGC TGCGGCGTCG CCTGGGGTAA CAATATTGTC
GAAGGAGCTA CCGCTCCCGC TCCGTCGCAA GTCGTAGACT CGCGCTTTGT CAATTTGCTT
GACAATGAAG TGCCTGACGA CCGCTATTTT CTCACTCCTA ACGCTGCCAT CGGCATTTTG
AGGAGGGCGG ATTCGGTTGG CCGAACGCTG TTCGGACCGA TGCGTGAAGC ATTGGAAAAT
ATGGTAAAAT GCTTCTCTGC TGCGGATTCG CCGCGGGTTT TAGCCGGAGA GCAGATTGCC
AAGGTCAGCA TTCGTCCGCC TCGCACAAAC AAGCGCGGCA ATTCTCAACT CGATCGCTCA
ATTGCCGTCA GGGCCACACG CATCTCCTAT TAA
 
Protein sequence
MNHALLQRVE ISETDGMRGA DGLSVELRVA SLFAGIGGFD KAFESVSASV VAQCEIDSFC 
RAVLRRHWPQ TKLFEDITKI NPAEFPAADI WTAGFPCQDV SLARGNHGRD GLKGNHTSLF
FKLMDLAEAK KPKIILLENV VGLLNSHQGC DFAIILRELT NQGYAVSWRV LNARYFGSPQ
SRSRVFMVAW RGDYRLALAS LFEPVRGAKT AAERKGFVTK TTHAKTGAIV PQVAYCVAAT
SGRHTGNDWA RSYISYKDRV RRPTVSESER LQGFEAGWTV PGAGYREPAR GFDSERYRAV
GNAVAVPVVR WIAQRMTAAL AQPKAPSSRR GFMEECLLIA PDLANSTETL RFSDIMEEVN
KGEFVYRWKG CGVAWGNNIV EGATAPAPSQ VVDSRFVNLL DNEVPDDRYF LTPNAAIGIL
RRADSVGRTL FGPMREALEN MVKCFSAADS PRVLAGEQIA KVSIRPPRTN KRGNSQLDRS
IAVRATRISY