Gene Saro_1452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1452 
Symbol 
ID3916117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1497036 
End bp1498343 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID640444196 
ProductDNA methylase N-4/N-6 
Protein accessionYP_496730 
Protein GI87199473 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase
[COG1475] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACA CCATCCCTTC GGCAGCACCG CTCGCCGATC GCCCGCTGAC CGTCGCCTAC 
AGGCCGACCG CCAGCCTTGT CCCCGATCCC CGCAACGCGC GCACCCATCC GCGACGACAG
ATCGAGCAAA TCGTTGCCTC GATCCGCGCC TTCGGGTTCA CCAACCCGGT GCTGGTCGAG
CCATCCGGCA AGATCATCGC CGGGCACGGC CGGCTCCTTG CCGCAAGGGA ACTCGGCCTT
GCCGAAGTGC CGGTCATCGA ACTTGCCGGA CTGGGCGAGG CACAGGTGCG CGCGCTGCGG
CTTGCCGACA ACCGTATCGC GCTCAATGCG GGGTGGGACA TCGAGATCCT CAAGCTCGAG
CTGGCCGACC TGTCGCTGCC CGAGATGGAG ATCGATCTTG CCCTCACCGG CTTTGCGAGT
GGCGAGATCG ACGTCATCCT CAGAGGTAGC ACGGATCCGG AAGATGACGT CATTCCGGCG
GTGCCGCGGA CGCCGCGCTC GCGCCCGGGC GACATCTGGC AGCTGGGCGC GCATCGCCTC
GGTTGCGGCG ATGGCAGGGA TGCAGCCTTC CTGCGGGCCG TTGTGGGTGA GGGCAAAGCG
ATCGACTGCG CGTTCCTCGA TCCGCCCTAC AACGTGAAGA TCAACGGCCA CGCCAATGCC
AGGGGACGGC ACCGCGAGTT CGCCATGGCC TCGGGCGAGA TGACCACGGC AGCGTTCCGC
ACGTTCCTTG CCGAGACGCT CGGAGCCAGT GCCGCGGTGT CGCGGCCCGG CGCGGTCCAC
TTCGTGTGCA TGGACTGGCG TCACATGGAC GATGTCAGCG CCGCGGCAAC GCCGGTCTAT
GACGATCTTC TCAACATCTG CGTGTGGAAC AAGAGCAACG CGGGAATGGG CTCCCTCTAC
CGTTCGAAGC ACGAGATGGT GTTCGTCTAC CGCGTGCCGG GGGCGCCGCA CACCAACGCG
GTGGAACTGG GACGTCATGG TCGCAACCGC ACCAACGTGT GGGACTATGC CTCGGTCAAT
TCGATGCGCG GCAGCCGCCG CGAGGACCTC GCGCTGCATC CCACGGTCAA GCCCGTGGCG
ATGGTTGCCG ATGCGATCTG CGATGTGACG CGGCAGGGGG ATCTCGTACT CGACATCTTC
TCGGGCTCCG GCACCACGCT CATTGCCGCC GAGCGGGTCG GCCGCGCCTT CCGCGGTATC
GACATCGATC CGGCCTATGT GGATGTCGCG CTTGATCGCT GGAGCGCGCT GACAGGGCGC
GAGCCGGTGC TGGTTGGCAG CGGCGCAGGC GGAGTGGACA GGGCATGA
 
Protein sequence
MADTIPSAAP LADRPLTVAY RPTASLVPDP RNARTHPRRQ IEQIVASIRA FGFTNPVLVE 
PSGKIIAGHG RLLAARELGL AEVPVIELAG LGEAQVRALR LADNRIALNA GWDIEILKLE
LADLSLPEME IDLALTGFAS GEIDVILRGS TDPEDDVIPA VPRTPRSRPG DIWQLGAHRL
GCGDGRDAAF LRAVVGEGKA IDCAFLDPPY NVKINGHANA RGRHREFAMA SGEMTTAAFR
TFLAETLGAS AAVSRPGAVH FVCMDWRHMD DVSAAATPVY DDLLNICVWN KSNAGMGSLY
RSKHEMVFVY RVPGAPHTNA VELGRHGRNR TNVWDYASVN SMRGSRREDL ALHPTVKPVA
MVADAICDVT RQGDLVLDIF SGSGTTLIAA ERVGRAFRGI DIDPAYVDVA LDRWSALTGR
EPVLVGSGAG GVDRA