Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2173 |
Symbol | |
ID | 3918838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2313519 |
End bp | 2316458 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640444928 |
Product | putative DNA methylase containing a Zn-ribbon |
Protein accession | YP_497446 |
Protein GI | 87200189 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.145611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATGA CCGCCACCGA GAAATCCGCC GTTGTGCCGT TAAGCTTGCG GGACGCTCCG TCCCTGATCG AGCGGGCCTG GCCGACGGCG AAGATCTCGG CGGAGACCCA GAAGGAGCGG AAGGCGGTTC AAGGGCAAAC ATTGACCGGG CTTGGTTCCT ACTGGAAGGG ACGCAAGCCC CTGATTTTAA CCCGGGCCTG CGTCCTCGCG GCACTGATGC CGGCAACCGA TGATCTGGAA CGGGACGTCG AGATCTTCGA GAAACTGATG GGGATGGCGG ACGAAACGTT CGGTCGTCGG TATGACGGTG GGCCGACTGC CTTCGTGAAG ACGTTCCCGG ATCACGCGCT TGAGGTGGCC GAGGATACAG GGCGTTCCTG GGTGTGGCGC GATGATCTTG ACCCGCGTGA ACGTCAGCGT CGGATCGCGC AAGCCTTTAT CGACATCCCT TACGGACAGC GCCTGGAGAA GGTGAAACGT CCTGAGGAGT TGGCGGAACA TGAGCTGCTA GACGGGATCT GGGGCGAGGT GAACGCTCAT CTGCGTACGC GTGCCAGTTC GATTGCCGAA CTGGTTGAAC AGCTCGGGGT CATGCGGTTT GGGCGACGCC CCCTTTTGGC CGACACCTTC TCCGGTTCGG GATCGATTCC CTTTGAGGCG GCCCGGGTCG GCTGTGACGT CATCGCGTCA GACCTGAACC CGATCGCCTG CATGCTGAGC TGGGCCAGCT TCAACATAGT GGGTGCCAGT CCTGAGCGGC ATGCGGAGAT CGTGCGCGAG ATGCGTCAGA TCGCAAACAA CGTGGACAAG GCGATCACGG ACATTGGGAT CGAGCATGAC GCCGACGGGA ACCGGGCGAA GCTCTACCTT TACTGCGTTG AGGTCCGGTG CCCCCGGACC GGCTGGATGG TGCCTGTTAC CCCCACCTGG ATGATCTCCA AGAAGCGGAA CACCATCGCC CGGCTCATCC CCGACCACGC GAACAAGCGC TATGAGATTG AGATACTTAA CGGGGTCACG GATAAGGAAG TCGAGGCGGC GGAGGTCGGA ACGCTCCGGG GCGGTCGTCT CCATCACCCG ATGCTAAAGG ATGACCTAGG CATTTCCATG AAGGAGATCC GGGGGGACTT CCGGGTTGAC GACGGCACGA GCGGAAACCG GCTCCGGCTG TGGGAAAAGG ATGACGTCCG GCCCCGTCCG AACGACATCT TCCAGGAACG CCTCTATGCG ATCCAGTGGA TGGACGGGCA GGACATCAAG AAGGGGAAAG CTAACCCTAG AACCTGGTTC GCTTCCGTGA CGCCCGAAGA CCTCGCACGT GAGGAGGAGG TCGTCGAATA TGTTGAGCGT CATCTGGCCG AATGGCAGCG GGATGGGTTC GTTTCTGACA TGCAAATTGA GCCTGGGAAC AAGACAGATG AACCGATCCG GACACGCGGC TGGACCTACT GGCACCATCT GTTCTGCCCA CGGCAAATTC TCTTCACAGC TCTAATACTG TCTGAGGCAA AGATGTCGCC AGAAGGCCGC GCCTTCGCAG CAAAATTCAT TGATTGGAAC TCTAAGCTCT GTCGCTACGG GACAGGCGCC GCGCGAGAGT CTGTGGCCCA AACCTTCTAT AATCAGGCCC TCAACACGTT CCCAAACTAC GGCGTGAGGT CATTCGGTTT CGCCCGCTCG TATCTAGAAG ACGTGCCCAA TAATTCCCCG CTATCTGGCC AGAGTAGGCT GATCAGTCGT CGTGCAAGTG AGGCTGATGA GCACGTAGAC ATCTATCTCA CAGATCCTCC CTATGCGGAC GCGGTCCAAT ATGACGAGAT CACCGAGTTT TTCATCGCAT GGCTGCGGAA GAATCCGCCT GCACCGTTCG ATAAGTGGAT CTGGGATTCC CGCCGGAACC TCGCCATCAA GGGCGAAGGG CAGTCCTTCA AGACAGCGAT GATCGACGCC TATGGCGCGA TGACCAAGCA TATGTCGGAC AATGGGATCC AGATCGTGCA GTTCACGCAC CAGGATGCCA AGACCTGGTC CGACATGGCT CAGATTTTCT GGGGCGCGGG TCTTCAGGTG GTTCAGGACT GGTACGTGTC GACTGAAAAT ATGACTGAGC TAAAGAAAGG TGGCTACGTC CAAGGGACCC ACATGATCGT CCTAAGGAAA CGCCAGGGCG AGCAGTCCGG CTACCAAGAT GAAATTGTCC ACGAGATCCG GGACGAGGTG GAGCGCCAGA TCAAAGACAT GATCGGGCTG AACGACCAGA TGGACGCGGC GCGCGGCGAG AATATCTTTA ACGACGCCGA CCTTCAGATG GCCGGCTACG CGGCCGCCCT CAGGGTTTTG ACTGCTTACA CGAAGATCGA TGGCAAGGAT ATGACGCAGG AAGCGCTCCG CCCACGCAAG AGGGGAGAAA AGACCCTGGT CGACGAGCTT ACTGAGTTCG CGGTCCAGAC TGCCAACGAA TTCCTGGTCC CGGATGGGCT GGAACGGGAG CTATGGCTCG AGCTGAACGG AGCCGAGCGG TTCTATCTTA AGATGATGGA CATCGAGGAG GCCGGTGAGG CCAAGCTCGA CCAATTCCAG AACTTCGCCA AGGCGTTCCG GGTTGGGGAC TACGATGACC TGATGGTTTC GAAGACTCCG AACAACTCGA AGCTCAAAAC CGCCAAGGGT CTCGGCCGCG GCGCCATGGC CGAGGGGGTT CAGTTCGGCC GCGACAGCAC TGTCCGGCTC GCTTTGCGCG CAATTTGGCG CGTCGGCAAG GAGGACGAGG TGGAGGACGT GCTAGACGAG CTACGCGACC TGATCCCGGA CTACCTACGC AAGCGGGACG TGCTCCGCCA GATCGTGGCC TACGTTGCCA CGAAGCGGGA GCGGAATGAC CCGGCAGAAG CGAGCGCAGC TCGGATCCTC GGGACCGCGA TCCAAACGGA GCGGATCTAA
|
Protein sequence | MTMTATEKSA VVPLSLRDAP SLIERAWPTA KISAETQKER KAVQGQTLTG LGSYWKGRKP LILTRACVLA ALMPATDDLE RDVEIFEKLM GMADETFGRR YDGGPTAFVK TFPDHALEVA EDTGRSWVWR DDLDPRERQR RIAQAFIDIP YGQRLEKVKR PEELAEHELL DGIWGEVNAH LRTRASSIAE LVEQLGVMRF GRRPLLADTF SGSGSIPFEA ARVGCDVIAS DLNPIACMLS WASFNIVGAS PERHAEIVRE MRQIANNVDK AITDIGIEHD ADGNRAKLYL YCVEVRCPRT GWMVPVTPTW MISKKRNTIA RLIPDHANKR YEIEILNGVT DKEVEAAEVG TLRGGRLHHP MLKDDLGISM KEIRGDFRVD DGTSGNRLRL WEKDDVRPRP NDIFQERLYA IQWMDGQDIK KGKANPRTWF ASVTPEDLAR EEEVVEYVER HLAEWQRDGF VSDMQIEPGN KTDEPIRTRG WTYWHHLFCP RQILFTALIL SEAKMSPEGR AFAAKFIDWN SKLCRYGTGA ARESVAQTFY NQALNTFPNY GVRSFGFARS YLEDVPNNSP LSGQSRLISR RASEADEHVD IYLTDPPYAD AVQYDEITEF FIAWLRKNPP APFDKWIWDS RRNLAIKGEG QSFKTAMIDA YGAMTKHMSD NGIQIVQFTH QDAKTWSDMA QIFWGAGLQV VQDWYVSTEN MTELKKGGYV QGTHMIVLRK RQGEQSGYQD EIVHEIRDEV ERQIKDMIGL NDQMDAARGE NIFNDADLQM AGYAAALRVL TAYTKIDGKD MTQEALRPRK RGEKTLVDEL TEFAVQTANE FLVPDGLERE LWLELNGAER FYLKMMDIEE AGEAKLDQFQ NFAKAFRVGD YDDLMVSKTP NNSKLKTAKG LGRGAMAEGV QFGRDSTVRL ALRAIWRVGK EDEVEDVLDE LRDLIPDYLR KRDVLRQIVA YVATKRERND PAEASAARIL GTAIQTERI
|
| |