Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0261 |
Symbol | |
ID | 3917613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 276073 |
End bp | 278802 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640442989 |
Product | type II restriction enzyme, methylase subunit |
Protein accession | YP_495543 |
Protein GI | 87198286 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.504808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCCG TAGAGATTGA AGAAGCCGTC TCGGACCTCG CTCGCGCACC CTATGACGCT TCCGAGTTTC CGTTCCAGTT CCTTGCCGCT TTCGGCAACA AGCAGACCAC GCTGCAGCGA TTGCGGGCGG GCAATTCCAA TCAGTCCGAT CTTCCGGGCG CCGTCTTGCA GCGTAACCAT ATCCACATAG CAACTTGCGA TGCGGGCAAC GTCGACAGAA CGCTGGCCGC TCTTCGCAAG AGTCCCAAGA CGGCGTCACA AAAGGCTCGG TTCATCCTCG CCACTGATGG CGTGGCCTTC CAGGCGGAGG ACATGGCCAG TGGCGAGACA GTAGCCTGCA ATTACGCCGC CTTCCCGGAC AAGTTTGCCT TCTTCCTGCC ACTGGCGGGC ATAACCACGG TCCAGCAGAT CCGTGAAAGC AGCTTTGACA TCAAGGCGAC CGGCCGCCTT AACAAGCTCT ATGTCGAACT GCTGAAGGAC AACCCTGACT GGGCCAGCAG AAGCGAGGAC ATGAACCACT TCATGGCCCG CCTGATCTTC TGCTTCTTTG CCGAGGATAC CGACATTTTC GTCGGCGAAG GCCTGTTCAG CCGCACCGTC GAGACCATGA GTGCGCGTGA TGCGTCCGAC ACGCACATGG TCATCGCCGA AATTTTCCGC GCGATGGACA CCAGGCTGGC CGATCGCGCT GCCGCCGGGA TCAAGAGCTG GGCCGATGTC TTCCCCTATG TGAACGGGCA GCTCTTCTCG GGATCCACCG AATGCCCGCG CTTCAGCAAG ATCGCGCGGT CATACCTGCT GCATATCGGC AGCCTCGATT GGCAGAAGAT CAACCCGGAC ATCTTCGGCT CGATGATCCA GGCCGTGGCC GACGATGAGG AGCGCGGTGC GCTCGGCATG CATTACACCA GCGTGCCGAA CATCCTGAAG GTTCTGAACC CGCTGTTCCT CGATGACCTG CGGGCGAAGC TTGAGGAAGC GGGTGACAAC AGCCGCAAGC TGCTCAACCT GCGCAACCGC ATGGCCAAGA TCCGGGTGTT CGATCCGGCC TGCGGATCGG GCAACTTCCT CGTCATCGCT TACAAGCAGA TGCGCGAGCT TGAGGCGGAG ATCAACCGGC GGCGCGGCGA GGCCGACCGG CGCAGCGACA TTCCGCTGAC CAACTTCCGC GGTATCGAAC TACGCAACTT TCCCGCCGAG ATCGCCCGGC TGGCCCTGAT TATCGCGGAG TACCAGTGCG ACGTGCTCTA TCGCGGGCAG AAGGAGGCGC TAGCCGAGTT CTTGCCGCTT GATAGCCAGA ACTGGATTAC CTGCGGCAAT GCCCTGCGGT TGGATTGGCT GAGCATCTGC CCACCAACCG GAACTGCCGT AAAACTGCAG GCGAACGACC TGTTTGAGAT GCCGCTCGAT CAGGCGGAGA TCGACTTCGA GAATGAAGGT GGGGAGACGT ATATCTGCGG TAATCCCCCT TATCTCGGGG CGAAGAAGAA GAGCTCGGAT CAAATAGAGG ATATGAAGCG AGTCGGGTTG GATAAAGCCC AACTTCTGGA CTACGTATCC GCTTTTATTG TTCGGGGATT GCCATTAGTT GCGCAACAAA GATGCGACAT GGCTCTGGTT TCGACCAGTT CAATATGTCA GGGAGAGCAA GTCTCGCTCA TATGGCCGCG CATATTAAAA TCCGCGAACG TCAAGTTCGC TTACCGACCA TTCCGATGGA GTAATTCCGC GGCGAACAAC GCTGGTGTCT ATTGCACGAT AATTGGCCTT ACTGGATCTG AGGTATCGAA TAAAAAGCTC TTTGGAGAAG GAAGTGTCGT AGAATGTTCG TCGATCGCGC CCTACCTCGT GCCGGGACCA GAGATCATTT GCGCTCCAAG GCAATCGTCG ATCTCAGGCT TCGCCCGTAT GGTAATGGGG AGCAACCCTG TAGATGGAAA GCGCTTGATT TTCGAGCAAG ATGAAAAAGA AAGCGTTGTT GCAGCCGACC CTCGGTCAGA ACGCTTCTTT AAGCGTTATG GGGGTACTCA AGAATTAGTT AATGGCGTGG ATCGATGGTG TTTGTGGATT AACGATGATC AAGTTGATGA CGCAAAAGCC ATTGCAGAAA TAGCGAAGGT GCTTGAAAGC TGTCGTTCAT ATAGGCAAGG CGCTGGCCGC GATGCTCAAA AAGCAGCAAA TCGTCCCCAC TCGTTTTGCT ACAGAACGTT TCAGGAAAAT ATTGGTATCC ATGTTGGCCT AACGATTGGT AACGGCCTCA GCCATGTTCC CGCTGATCTT AAGAGTAGCG GCTTTGTTTC TAGCCATACT GCATACATGA TTTATGGTTG GCATCCGGTT GAGTTCGCGT TGTTGAACTC GCGGCTGATG TTGGTTTGGA CTGAAACGGT TGGTGGCAGA CTGGGTAATG GAATGCGCTT CAGCAACACG ATCGTTTATA ATACATTCCC GGTCCCTTCC CTCACTGACC AGAACAAGGC CGACCTCACC CGCTGCGCGG AGGACATCCT CCTCGCCCGA GAGTCGCATT TCCCGGCTAC GATTGCGGAC CTCTATGATC CCGAGACCAT GCCCGAAAGC CTGCGCGCCG CGCACGATCG CAACGACGAA GTCCTCGAAC GCATCTACAT CGGCCGCCGC TTCCGCAACG ACACCGAACG CCTCGAAAAG CTATTCGAAC TCTACACCAA AATGACTGGC GGACGATCCT CAGAAGGTGG AGCGGCATGA
|
Protein sequence | MNPVEIEEAV SDLARAPYDA SEFPFQFLAA FGNKQTTLQR LRAGNSNQSD LPGAVLQRNH IHIATCDAGN VDRTLAALRK SPKTASQKAR FILATDGVAF QAEDMASGET VACNYAAFPD KFAFFLPLAG ITTVQQIRES SFDIKATGRL NKLYVELLKD NPDWASRSED MNHFMARLIF CFFAEDTDIF VGEGLFSRTV ETMSARDASD THMVIAEIFR AMDTRLADRA AAGIKSWADV FPYVNGQLFS GSTECPRFSK IARSYLLHIG SLDWQKINPD IFGSMIQAVA DDEERGALGM HYTSVPNILK VLNPLFLDDL RAKLEEAGDN SRKLLNLRNR MAKIRVFDPA CGSGNFLVIA YKQMRELEAE INRRRGEADR RSDIPLTNFR GIELRNFPAE IARLALIIAE YQCDVLYRGQ KEALAEFLPL DSQNWITCGN ALRLDWLSIC PPTGTAVKLQ ANDLFEMPLD QAEIDFENEG GETYICGNPP YLGAKKKSSD QIEDMKRVGL DKAQLLDYVS AFIVRGLPLV AQQRCDMALV STSSICQGEQ VSLIWPRILK SANVKFAYRP FRWSNSAANN AGVYCTIIGL TGSEVSNKKL FGEGSVVECS SIAPYLVPGP EIICAPRQSS ISGFARMVMG SNPVDGKRLI FEQDEKESVV AADPRSERFF KRYGGTQELV NGVDRWCLWI NDDQVDDAKA IAEIAKVLES CRSYRQGAGR DAQKAANRPH SFCYRTFQEN IGIHVGLTIG NGLSHVPADL KSSGFVSSHT AYMIYGWHPV EFALLNSRLM LVWTETVGGR LGNGMRFSNT IVYNTFPVPS LTDQNKADLT RCAEDILLAR ESHFPATIAD LYDPETMPES LRAAHDRNDE VLERIYIGRR FRNDTERLEK LFELYTKMTG GRSSEGGAA
|
| |