Gene Saro_0261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0261 
Symbol 
ID3917613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp276073 
End bp278802 
Gene Length2730 bp 
Protein Length909 aa 
Translation table11 
GC content56% 
IMG OID640442989 
Producttype II restriction enzyme, methylase subunit 
Protein accessionYP_495543 
Protein GI87198286 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.504808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCCG TAGAGATTGA AGAAGCCGTC TCGGACCTCG CTCGCGCACC CTATGACGCT 
TCCGAGTTTC CGTTCCAGTT CCTTGCCGCT TTCGGCAACA AGCAGACCAC GCTGCAGCGA
TTGCGGGCGG GCAATTCCAA TCAGTCCGAT CTTCCGGGCG CCGTCTTGCA GCGTAACCAT
ATCCACATAG CAACTTGCGA TGCGGGCAAC GTCGACAGAA CGCTGGCCGC TCTTCGCAAG
AGTCCCAAGA CGGCGTCACA AAAGGCTCGG TTCATCCTCG CCACTGATGG CGTGGCCTTC
CAGGCGGAGG ACATGGCCAG TGGCGAGACA GTAGCCTGCA ATTACGCCGC CTTCCCGGAC
AAGTTTGCCT TCTTCCTGCC ACTGGCGGGC ATAACCACGG TCCAGCAGAT CCGTGAAAGC
AGCTTTGACA TCAAGGCGAC CGGCCGCCTT AACAAGCTCT ATGTCGAACT GCTGAAGGAC
AACCCTGACT GGGCCAGCAG AAGCGAGGAC ATGAACCACT TCATGGCCCG CCTGATCTTC
TGCTTCTTTG CCGAGGATAC CGACATTTTC GTCGGCGAAG GCCTGTTCAG CCGCACCGTC
GAGACCATGA GTGCGCGTGA TGCGTCCGAC ACGCACATGG TCATCGCCGA AATTTTCCGC
GCGATGGACA CCAGGCTGGC CGATCGCGCT GCCGCCGGGA TCAAGAGCTG GGCCGATGTC
TTCCCCTATG TGAACGGGCA GCTCTTCTCG GGATCCACCG AATGCCCGCG CTTCAGCAAG
ATCGCGCGGT CATACCTGCT GCATATCGGC AGCCTCGATT GGCAGAAGAT CAACCCGGAC
ATCTTCGGCT CGATGATCCA GGCCGTGGCC GACGATGAGG AGCGCGGTGC GCTCGGCATG
CATTACACCA GCGTGCCGAA CATCCTGAAG GTTCTGAACC CGCTGTTCCT CGATGACCTG
CGGGCGAAGC TTGAGGAAGC GGGTGACAAC AGCCGCAAGC TGCTCAACCT GCGCAACCGC
ATGGCCAAGA TCCGGGTGTT CGATCCGGCC TGCGGATCGG GCAACTTCCT CGTCATCGCT
TACAAGCAGA TGCGCGAGCT TGAGGCGGAG ATCAACCGGC GGCGCGGCGA GGCCGACCGG
CGCAGCGACA TTCCGCTGAC CAACTTCCGC GGTATCGAAC TACGCAACTT TCCCGCCGAG
ATCGCCCGGC TGGCCCTGAT TATCGCGGAG TACCAGTGCG ACGTGCTCTA TCGCGGGCAG
AAGGAGGCGC TAGCCGAGTT CTTGCCGCTT GATAGCCAGA ACTGGATTAC CTGCGGCAAT
GCCCTGCGGT TGGATTGGCT GAGCATCTGC CCACCAACCG GAACTGCCGT AAAACTGCAG
GCGAACGACC TGTTTGAGAT GCCGCTCGAT CAGGCGGAGA TCGACTTCGA GAATGAAGGT
GGGGAGACGT ATATCTGCGG TAATCCCCCT TATCTCGGGG CGAAGAAGAA GAGCTCGGAT
CAAATAGAGG ATATGAAGCG AGTCGGGTTG GATAAAGCCC AACTTCTGGA CTACGTATCC
GCTTTTATTG TTCGGGGATT GCCATTAGTT GCGCAACAAA GATGCGACAT GGCTCTGGTT
TCGACCAGTT CAATATGTCA GGGAGAGCAA GTCTCGCTCA TATGGCCGCG CATATTAAAA
TCCGCGAACG TCAAGTTCGC TTACCGACCA TTCCGATGGA GTAATTCCGC GGCGAACAAC
GCTGGTGTCT ATTGCACGAT AATTGGCCTT ACTGGATCTG AGGTATCGAA TAAAAAGCTC
TTTGGAGAAG GAAGTGTCGT AGAATGTTCG TCGATCGCGC CCTACCTCGT GCCGGGACCA
GAGATCATTT GCGCTCCAAG GCAATCGTCG ATCTCAGGCT TCGCCCGTAT GGTAATGGGG
AGCAACCCTG TAGATGGAAA GCGCTTGATT TTCGAGCAAG ATGAAAAAGA AAGCGTTGTT
GCAGCCGACC CTCGGTCAGA ACGCTTCTTT AAGCGTTATG GGGGTACTCA AGAATTAGTT
AATGGCGTGG ATCGATGGTG TTTGTGGATT AACGATGATC AAGTTGATGA CGCAAAAGCC
ATTGCAGAAA TAGCGAAGGT GCTTGAAAGC TGTCGTTCAT ATAGGCAAGG CGCTGGCCGC
GATGCTCAAA AAGCAGCAAA TCGTCCCCAC TCGTTTTGCT ACAGAACGTT TCAGGAAAAT
ATTGGTATCC ATGTTGGCCT AACGATTGGT AACGGCCTCA GCCATGTTCC CGCTGATCTT
AAGAGTAGCG GCTTTGTTTC TAGCCATACT GCATACATGA TTTATGGTTG GCATCCGGTT
GAGTTCGCGT TGTTGAACTC GCGGCTGATG TTGGTTTGGA CTGAAACGGT TGGTGGCAGA
CTGGGTAATG GAATGCGCTT CAGCAACACG ATCGTTTATA ATACATTCCC GGTCCCTTCC
CTCACTGACC AGAACAAGGC CGACCTCACC CGCTGCGCGG AGGACATCCT CCTCGCCCGA
GAGTCGCATT TCCCGGCTAC GATTGCGGAC CTCTATGATC CCGAGACCAT GCCCGAAAGC
CTGCGCGCCG CGCACGATCG CAACGACGAA GTCCTCGAAC GCATCTACAT CGGCCGCCGC
TTCCGCAACG ACACCGAACG CCTCGAAAAG CTATTCGAAC TCTACACCAA AATGACTGGC
GGACGATCCT CAGAAGGTGG AGCGGCATGA
 
Protein sequence
MNPVEIEEAV SDLARAPYDA SEFPFQFLAA FGNKQTTLQR LRAGNSNQSD LPGAVLQRNH 
IHIATCDAGN VDRTLAALRK SPKTASQKAR FILATDGVAF QAEDMASGET VACNYAAFPD
KFAFFLPLAG ITTVQQIRES SFDIKATGRL NKLYVELLKD NPDWASRSED MNHFMARLIF
CFFAEDTDIF VGEGLFSRTV ETMSARDASD THMVIAEIFR AMDTRLADRA AAGIKSWADV
FPYVNGQLFS GSTECPRFSK IARSYLLHIG SLDWQKINPD IFGSMIQAVA DDEERGALGM
HYTSVPNILK VLNPLFLDDL RAKLEEAGDN SRKLLNLRNR MAKIRVFDPA CGSGNFLVIA
YKQMRELEAE INRRRGEADR RSDIPLTNFR GIELRNFPAE IARLALIIAE YQCDVLYRGQ
KEALAEFLPL DSQNWITCGN ALRLDWLSIC PPTGTAVKLQ ANDLFEMPLD QAEIDFENEG
GETYICGNPP YLGAKKKSSD QIEDMKRVGL DKAQLLDYVS AFIVRGLPLV AQQRCDMALV
STSSICQGEQ VSLIWPRILK SANVKFAYRP FRWSNSAANN AGVYCTIIGL TGSEVSNKKL
FGEGSVVECS SIAPYLVPGP EIICAPRQSS ISGFARMVMG SNPVDGKRLI FEQDEKESVV
AADPRSERFF KRYGGTQELV NGVDRWCLWI NDDQVDDAKA IAEIAKVLES CRSYRQGAGR
DAQKAANRPH SFCYRTFQEN IGIHVGLTIG NGLSHVPADL KSSGFVSSHT AYMIYGWHPV
EFALLNSRLM LVWTETVGGR LGNGMRFSNT IVYNTFPVPS LTDQNKADLT RCAEDILLAR
ESHFPATIAD LYDPETMPES LRAAHDRNDE VLERIYIGRR FRNDTERLEK LFELYTKMTG
GRSSEGGAA