Gene Saro_2173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2173 
Symbol 
ID3918838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2313519 
End bp2316458 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content59% 
IMG OID640444928 
Productputative DNA methylase containing a Zn-ribbon 
Protein accessionYP_497446 
Protein GI87200189 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.145611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATGA CCGCCACCGA GAAATCCGCC GTTGTGCCGT TAAGCTTGCG GGACGCTCCG 
TCCCTGATCG AGCGGGCCTG GCCGACGGCG AAGATCTCGG CGGAGACCCA GAAGGAGCGG
AAGGCGGTTC AAGGGCAAAC ATTGACCGGG CTTGGTTCCT ACTGGAAGGG ACGCAAGCCC
CTGATTTTAA CCCGGGCCTG CGTCCTCGCG GCACTGATGC CGGCAACCGA TGATCTGGAA
CGGGACGTCG AGATCTTCGA GAAACTGATG GGGATGGCGG ACGAAACGTT CGGTCGTCGG
TATGACGGTG GGCCGACTGC CTTCGTGAAG ACGTTCCCGG ATCACGCGCT TGAGGTGGCC
GAGGATACAG GGCGTTCCTG GGTGTGGCGC GATGATCTTG ACCCGCGTGA ACGTCAGCGT
CGGATCGCGC AAGCCTTTAT CGACATCCCT TACGGACAGC GCCTGGAGAA GGTGAAACGT
CCTGAGGAGT TGGCGGAACA TGAGCTGCTA GACGGGATCT GGGGCGAGGT GAACGCTCAT
CTGCGTACGC GTGCCAGTTC GATTGCCGAA CTGGTTGAAC AGCTCGGGGT CATGCGGTTT
GGGCGACGCC CCCTTTTGGC CGACACCTTC TCCGGTTCGG GATCGATTCC CTTTGAGGCG
GCCCGGGTCG GCTGTGACGT CATCGCGTCA GACCTGAACC CGATCGCCTG CATGCTGAGC
TGGGCCAGCT TCAACATAGT GGGTGCCAGT CCTGAGCGGC ATGCGGAGAT CGTGCGCGAG
ATGCGTCAGA TCGCAAACAA CGTGGACAAG GCGATCACGG ACATTGGGAT CGAGCATGAC
GCCGACGGGA ACCGGGCGAA GCTCTACCTT TACTGCGTTG AGGTCCGGTG CCCCCGGACC
GGCTGGATGG TGCCTGTTAC CCCCACCTGG ATGATCTCCA AGAAGCGGAA CACCATCGCC
CGGCTCATCC CCGACCACGC GAACAAGCGC TATGAGATTG AGATACTTAA CGGGGTCACG
GATAAGGAAG TCGAGGCGGC GGAGGTCGGA ACGCTCCGGG GCGGTCGTCT CCATCACCCG
ATGCTAAAGG ATGACCTAGG CATTTCCATG AAGGAGATCC GGGGGGACTT CCGGGTTGAC
GACGGCACGA GCGGAAACCG GCTCCGGCTG TGGGAAAAGG ATGACGTCCG GCCCCGTCCG
AACGACATCT TCCAGGAACG CCTCTATGCG ATCCAGTGGA TGGACGGGCA GGACATCAAG
AAGGGGAAAG CTAACCCTAG AACCTGGTTC GCTTCCGTGA CGCCCGAAGA CCTCGCACGT
GAGGAGGAGG TCGTCGAATA TGTTGAGCGT CATCTGGCCG AATGGCAGCG GGATGGGTTC
GTTTCTGACA TGCAAATTGA GCCTGGGAAC AAGACAGATG AACCGATCCG GACACGCGGC
TGGACCTACT GGCACCATCT GTTCTGCCCA CGGCAAATTC TCTTCACAGC TCTAATACTG
TCTGAGGCAA AGATGTCGCC AGAAGGCCGC GCCTTCGCAG CAAAATTCAT TGATTGGAAC
TCTAAGCTCT GTCGCTACGG GACAGGCGCC GCGCGAGAGT CTGTGGCCCA AACCTTCTAT
AATCAGGCCC TCAACACGTT CCCAAACTAC GGCGTGAGGT CATTCGGTTT CGCCCGCTCG
TATCTAGAAG ACGTGCCCAA TAATTCCCCG CTATCTGGCC AGAGTAGGCT GATCAGTCGT
CGTGCAAGTG AGGCTGATGA GCACGTAGAC ATCTATCTCA CAGATCCTCC CTATGCGGAC
GCGGTCCAAT ATGACGAGAT CACCGAGTTT TTCATCGCAT GGCTGCGGAA GAATCCGCCT
GCACCGTTCG ATAAGTGGAT CTGGGATTCC CGCCGGAACC TCGCCATCAA GGGCGAAGGG
CAGTCCTTCA AGACAGCGAT GATCGACGCC TATGGCGCGA TGACCAAGCA TATGTCGGAC
AATGGGATCC AGATCGTGCA GTTCACGCAC CAGGATGCCA AGACCTGGTC CGACATGGCT
CAGATTTTCT GGGGCGCGGG TCTTCAGGTG GTTCAGGACT GGTACGTGTC GACTGAAAAT
ATGACTGAGC TAAAGAAAGG TGGCTACGTC CAAGGGACCC ACATGATCGT CCTAAGGAAA
CGCCAGGGCG AGCAGTCCGG CTACCAAGAT GAAATTGTCC ACGAGATCCG GGACGAGGTG
GAGCGCCAGA TCAAAGACAT GATCGGGCTG AACGACCAGA TGGACGCGGC GCGCGGCGAG
AATATCTTTA ACGACGCCGA CCTTCAGATG GCCGGCTACG CGGCCGCCCT CAGGGTTTTG
ACTGCTTACA CGAAGATCGA TGGCAAGGAT ATGACGCAGG AAGCGCTCCG CCCACGCAAG
AGGGGAGAAA AGACCCTGGT CGACGAGCTT ACTGAGTTCG CGGTCCAGAC TGCCAACGAA
TTCCTGGTCC CGGATGGGCT GGAACGGGAG CTATGGCTCG AGCTGAACGG AGCCGAGCGG
TTCTATCTTA AGATGATGGA CATCGAGGAG GCCGGTGAGG CCAAGCTCGA CCAATTCCAG
AACTTCGCCA AGGCGTTCCG GGTTGGGGAC TACGATGACC TGATGGTTTC GAAGACTCCG
AACAACTCGA AGCTCAAAAC CGCCAAGGGT CTCGGCCGCG GCGCCATGGC CGAGGGGGTT
CAGTTCGGCC GCGACAGCAC TGTCCGGCTC GCTTTGCGCG CAATTTGGCG CGTCGGCAAG
GAGGACGAGG TGGAGGACGT GCTAGACGAG CTACGCGACC TGATCCCGGA CTACCTACGC
AAGCGGGACG TGCTCCGCCA GATCGTGGCC TACGTTGCCA CGAAGCGGGA GCGGAATGAC
CCGGCAGAAG CGAGCGCAGC TCGGATCCTC GGGACCGCGA TCCAAACGGA GCGGATCTAA
 
Protein sequence
MTMTATEKSA VVPLSLRDAP SLIERAWPTA KISAETQKER KAVQGQTLTG LGSYWKGRKP 
LILTRACVLA ALMPATDDLE RDVEIFEKLM GMADETFGRR YDGGPTAFVK TFPDHALEVA
EDTGRSWVWR DDLDPRERQR RIAQAFIDIP YGQRLEKVKR PEELAEHELL DGIWGEVNAH
LRTRASSIAE LVEQLGVMRF GRRPLLADTF SGSGSIPFEA ARVGCDVIAS DLNPIACMLS
WASFNIVGAS PERHAEIVRE MRQIANNVDK AITDIGIEHD ADGNRAKLYL YCVEVRCPRT
GWMVPVTPTW MISKKRNTIA RLIPDHANKR YEIEILNGVT DKEVEAAEVG TLRGGRLHHP
MLKDDLGISM KEIRGDFRVD DGTSGNRLRL WEKDDVRPRP NDIFQERLYA IQWMDGQDIK
KGKANPRTWF ASVTPEDLAR EEEVVEYVER HLAEWQRDGF VSDMQIEPGN KTDEPIRTRG
WTYWHHLFCP RQILFTALIL SEAKMSPEGR AFAAKFIDWN SKLCRYGTGA ARESVAQTFY
NQALNTFPNY GVRSFGFARS YLEDVPNNSP LSGQSRLISR RASEADEHVD IYLTDPPYAD
AVQYDEITEF FIAWLRKNPP APFDKWIWDS RRNLAIKGEG QSFKTAMIDA YGAMTKHMSD
NGIQIVQFTH QDAKTWSDMA QIFWGAGLQV VQDWYVSTEN MTELKKGGYV QGTHMIVLRK
RQGEQSGYQD EIVHEIRDEV ERQIKDMIGL NDQMDAARGE NIFNDADLQM AGYAAALRVL
TAYTKIDGKD MTQEALRPRK RGEKTLVDEL TEFAVQTANE FLVPDGLERE LWLELNGAER
FYLKMMDIEE AGEAKLDQFQ NFAKAFRVGD YDDLMVSKTP NNSKLKTAKG LGRGAMAEGV
QFGRDSTVRL ALRAIWRVGK EDEVEDVLDE LRDLIPDYLR KRDVLRQIVA YVATKRERND
PAEASAARIL GTAIQTERI