Gene Saro_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1840 
Symbol 
ID3918400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1939534 
End bp1940676 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content63% 
IMG OID640444582 
ProductDNA methylase N-4/N-6 
Protein accessionYP_497114 
Protein GI87199857 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0373756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTTG CAACCAAGGA ACGCGCTAAG GCTCTTCGCG CGGCACCGGC AAAGGTGCTG 
AAGGCAGACA TCGCGCTGCC CGTGAACCAG ATCCTGCGCG GCGATTGCAT TGCCGAGATG
CGCAAGCTGC CCGACGCCTC CATCGACATG ATCTTCGCCG ATCCGCCCTA CAACCTCCAG
CTTGGCGGCG ATCTGGCTCG TCCCGATGGC AGCCATGTGG ACGCCGTCAC CAACGATTGG
GACAAGTTCT CGAGCTTTGC CGCCTACGAC AAGTTCACGC GCGAATGGCT GGTCGAGGCG
CGCCGCCTGC TGAAGCCGGA TGGTTCGATC TGGGTGATCG GCAGCTACCA CAACATCTTC
CGCGTGGGTG CGCTGCTGCA GGATCTGGGG TTCTGGATTC TCAACGACAT CATCTGGCGC
AAGGCCAACC CGATGCCCAA TTTCAAGGGC ACCCGCTTCA CCAACGCGCA CGAAACGCTG
ATCTGGGCGT CGAAGAGCGA GAAGTCGAAG TACACCTTCA ACTATCGCGC GATGAAGACC
CTGAACGACG AATTGCAGAT GCGCTCCGAC TGGGTTCTGC CGATCTGTTC GGGGCCGGAG
CGCCTGCGCC GCAACGGCAC CAAGGCGCAC CCGACGCAGA AGCCAGAGGC GCTGCTCTAT
CGCGTGATGC TTGCGACGAC CAACAAGGGC GACGTCGTGC TGGACCCGTT TTTCGGCACT
GGCACCACCG GCGCGGTGGC CAAGCGGCTT GGCCGCAACT GGATCGGCTG CGAACGCGAG
GATGACTACA TCGAGGTCGC CAACGAGCGC ATCGAACTGG CGCTGCCGCT TGACGAAAGC
GCGCTGACGA CGATGCAGTC GAAGCGTAGC GCGCCCAAGG TGGCGTTCGG CGCACTGGTC
GAAAGCGGTT ATCTGGCTCC CGGCACGCGG CTCACGGCCA AGAAGGGGCG GTTCAATGCG
GTCGTTCGCG CCGACGGGTC GCTTCAGTCC GAAGCCGAGA TCGGTTCGAT CCACGGGCTC
GGGGCAAAGC TCCAGGGTGC GCCTTCGTGC AATGGCTGGA CGTTCTGGCA TGTCGAGCAC
GAAGGCGAGG TGAAGCCGAT CGACGCTCTG CGCCAGCTCT ACCTGCTCGC CGTGGAAGAT
TGA
 
Protein sequence
MAVATKERAK ALRAAPAKVL KADIALPVNQ ILRGDCIAEM RKLPDASIDM IFADPPYNLQ 
LGGDLARPDG SHVDAVTNDW DKFSSFAAYD KFTREWLVEA RRLLKPDGSI WVIGSYHNIF
RVGALLQDLG FWILNDIIWR KANPMPNFKG TRFTNAHETL IWASKSEKSK YTFNYRAMKT
LNDELQMRSD WVLPICSGPE RLRRNGTKAH PTQKPEALLY RVMLATTNKG DVVLDPFFGT
GTTGAVAKRL GRNWIGCERE DDYIEVANER IELALPLDES ALTTMQSKRS APKVAFGALV
ESGYLAPGTR LTAKKGRFNA VVRADGSLQS EAEIGSIHGL GAKLQGAPSC NGWTFWHVEH
EGEVKPIDAL RQLYLLAVED