Gene Rsph17029_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0119 
Symbol 
ID4897384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp134469 
End bp135587 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID640110702 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001042011 
Protein GI126460897 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.222875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.070794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA AGACCAAGAC GACGGAGGCC CCCGTGCTTC CGCTGAACCA GATCCTTGCG 
GGCGATTGCA TCGAGACGAT GCGGTCCCTG CCCGAATGTT CGGTCGACCT GATCTTCGCC
GATCCGCCCT ACAACCTGCA GCTCCGCGGC GACCTGCACC GGCCCGACAA CAGCCGGGTG
GATGCGGTGG ACGACCACTG GGACCAGTTC TCGAGCTTCT CGGTCTATGA CCAGTTCACC
CGCGAATGGC TCGCGGCCGC CCGGCGGCTG CTGAAGCCCA ACGGCGCGAT CTGGGTCATC
GGCAGCTATC ACAACATCTT CCGCGTGGGG GCAGCCCTTC AGGACCAGGG CTTCTGGATC
CTGAACGATG TGGTCTGGCG CAAGTCGAAC CCGATGCCGA ACTTCAAGGG CAAGCGGCTG
ACCAACGCGC ACGAGACGCT GATCTGGGCC TCGAAGCAGG AAGCCAGCAA ATATACCTTC
AATTACGAGG CACTGAAGGC CCTGAACGAG GGCGTGCAGA TGCGCTCGGA CTGGGTGATC
CCGATCTGCA CCGGCCATGA GCGGCTGAAG GACGAGCAGG GCGACAAGGC CCACCCGACC
CAGAAGCCCG AGGCGCTGCT GCACCGGGTG ATGGTCGCCA CGACCAATCC GGGCGACGTG
GTGCTCGACC CGTTCTTCGG CACCGGCACG ACCGGCGCGG TGGCCAAGAT GCTCGGCCGC
GACTTCATCG GCATCGAGCG CGAAGAGAGC TACCGCAGGA TCGCGGCCGA GCGGCTGTCG
CGCGTGCGCC GCTACGACGC CTCGGCGCTC GAGGTCTCGG GCTCGAAGCG GGCCGAGCCG
CGGGTGCCCT TCGGCCAGCT GGTCGAGCGC GGGATGCTGC GCCCGGGCGA AGAGCTCTAT
TCGATGAACA ACCGCCACAA GGCGAAGGTG CGCGCCGACG GCACGCTGAT CGGCAACGAT
GTGAAGGGCT CGATCCACCA GGTCGGCGCC GCGCTGGAAG GCGCGCCCTC CTGCAACGGC
TGGACCTACT GGTGCTACAA GCGCGAGGGG AAGATGATCC CCATCGACAT CCTGCGCCAG
CAGATCCGGG CGGAGATGGA AGACCCGCGC CCCAACTGA
 
Protein sequence
MATKTKTTEA PVLPLNQILA GDCIETMRSL PECSVDLIFA DPPYNLQLRG DLHRPDNSRV 
DAVDDHWDQF SSFSVYDQFT REWLAAARRL LKPNGAIWVI GSYHNIFRVG AALQDQGFWI
LNDVVWRKSN PMPNFKGKRL TNAHETLIWA SKQEASKYTF NYEALKALNE GVQMRSDWVI
PICTGHERLK DEQGDKAHPT QKPEALLHRV MVATTNPGDV VLDPFFGTGT TGAVAKMLGR
DFIGIEREES YRRIAAERLS RVRRYDASAL EVSGSKRAEP RVPFGQLVER GMLRPGEELY
SMNNRHKAKV RADGTLIGND VKGSIHQVGA ALEGAPSCNG WTYWCYKREG KMIPIDILRQ
QIRAEMEDPR PN