Gene Saro_0350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0350 
Symbol 
ID3918234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp376858 
End bp378921 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content65% 
IMG OID640443079 
ProductMcrBC 5-methylcytosine restriction system component-like 
Protein accessionYP_495632 
Protein GI87198375 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGCCG AGATGGCCTA TTCGCGGGCC GACGAAGCAG CGGGTGCGTC TAACCCGAGT 
GCAATTGATC TCAGCGGATC GTTCTACGCC AATCTTTCCG CATGCCGTGC AGCTGCCCGG
GCCTATTTCC GCTTCTGCAA TTCTGAGGCT CGTCCACAGG GCAGGTTTGG CGAGCTCGAC
CGGGAGGCAG TTCTCGAGGC GGTGGCCGCA TGTGACGCCG TTGGCGACGT CGCCCAGTAT
GTCGCTGATC TCGATCTCGG GCAGCCGACC AGATACTGGC TCGTTCTGGA CGGCAAGCGC
TACCCCAGCA AAGCCGTCGT ACGCGATGCT TTGGCGAGAC GCGGCAGTGA CTGGCTGCCT
GGCGGCGGTG AATGCAAGAC CGCGCTCGAG CGCCTCGGCT TTGTCGTCAT CGACTGGCCC
GAACTCAACC GCGCTCGCGA TGCATTCCTG CGCCAGATGC CGGATTTCAG CGATTTCCGT
GCCGCTGCCG GCGCCTACTG GGATGTGGAA CGCGCCTACA AGAACGGACT GATCGAGCAG
GCCAAGGCAA TCATTGCACG GCAGGATGAC GACCGCGCGG TTGGCGAAAG CCTCTACCGG
CTGCTTTCGG TGGGCGGTTC GGGCCTGCCG CTGAGCTGGC GGACACTGTC CGAAGTCCAG
AACGCCGATC CCGAGCTGCG CGACCGTTTC TACACCTCGC TGGGGGTGCT TGCTCGCAGC
GATGGCCCGC TCGAAGAAGC TGTCCCGGCC GCGGCGCGCG AGCTCGAAGC CCTGCGTGAA
GCGGGCATTG CAGGGCTGCG CCGGGGCGAG GTGCTCTCCA TTCCGATTAC CGTCTGGGCC
ACTTTGCACC CCGACCAGGC GAGTTGGTTC AAGATCGCCA AGATCGACGA GATGGGGCGG
CGGTTGTTCG GTCGCAGGCT GTTCCCGCAA ACCGAGTTCC GCGACGCTGA TCTTGCCGAA
TGGCTGCAGT TGATGCGGGC GCTGCTCGGG TTGCTCGATA AGGAATTCGG GTGGCACCCG
CATGACCTGT TCGATGTGCA GGGGTTCATC TGGGTGGTCG GCAATCCGGA TTCCCCCCGC
GAACTCGATC CGGTGCCCGT CTGGATGGTG ACCTCGCTGT GGGGGCAAGA GGACGGCTTG
CCGCGCTTCG TCGAGCGGGC AGAGTGGAGT TTGCTCACTG ACACCGGCAG CGCGAACAAC
CGCCGCGTCC GCGAGATGCA GGTCGGTGAC CGGATCTTCC TCAAGGATTT CGTGCCGCGC
GCCCGCGATC TGCCCTTCGA TGCCGGAACG GGGATCATGG CGGCGGCGAC CGTGTTCCTG
GCCCGTCACA CCCGCTCGCT CGCCACCCGG CGCACGCTTG ACGAGTTGAG ACATGCTCTG
GCCGATATCC CGCTGATGCC AATCACGCGG CTGCCGTGGC AAGCGGTGCG GATTGATCGC
ACCAACCGGC GCTGGGAGGC GCTGTTCCGG CTCGCCCGCC TGCTGCTTCA GCGCGACTGG
CAGGCTACTC ACCATCACGC CAAGGCCCCT GATGGTCTGA CCCTGCTTTT TCCAATGAAC
GACCTGTTCG AGAAATACAT CGCTGTGCTG CTTCGCCGGG CGCTGGCGGG GAGCGGGATC
GAGGTGATCG ACCAGGGCGG CCACCGCGCC TGCCTTGGCT CCTTTACTGG CGGGCATCTC
GAGACCGGCG AGGTGTTCCG CACCAAACCT GACATCATGT TGCGCCGTGG TCGCGAAATT
GTGGCCATCA TCGATACCAA GTGGAAGAAG CTCAGCCTCG ACCCGCTCGA CCGCAAGCAC
GGGGTTAGCC AGGCTGATGT CTATCAGCTC ATGGCCTATG CGCGGCTCTA CCAGACGGCC
GAGCTGATGC TACTTTACCC GGCGCGACCG GGGCAGGTGT GCGCAGAGCG CGCACAGTTC
GGCATGGCGG GCGGGAGCGA GCGCCTCAGA ATCGCGATGG CTGACGTCTC GCTGGACGAG
AAGGCTCTGG CAGAGGCTCT CGGAGTGCTG GTGATGGCGC CCGCCGTCAC CAAGGCTTCG
CCATTGCCGC AGGCGGTGGG GTAG
 
Protein sequence
MLAEMAYSRA DEAAGASNPS AIDLSGSFYA NLSACRAAAR AYFRFCNSEA RPQGRFGELD 
REAVLEAVAA CDAVGDVAQY VADLDLGQPT RYWLVLDGKR YPSKAVVRDA LARRGSDWLP
GGGECKTALE RLGFVVIDWP ELNRARDAFL RQMPDFSDFR AAAGAYWDVE RAYKNGLIEQ
AKAIIARQDD DRAVGESLYR LLSVGGSGLP LSWRTLSEVQ NADPELRDRF YTSLGVLARS
DGPLEEAVPA AARELEALRE AGIAGLRRGE VLSIPITVWA TLHPDQASWF KIAKIDEMGR
RLFGRRLFPQ TEFRDADLAE WLQLMRALLG LLDKEFGWHP HDLFDVQGFI WVVGNPDSPR
ELDPVPVWMV TSLWGQEDGL PRFVERAEWS LLTDTGSANN RRVREMQVGD RIFLKDFVPR
ARDLPFDAGT GIMAAATVFL ARHTRSLATR RTLDELRHAL ADIPLMPITR LPWQAVRIDR
TNRRWEALFR LARLLLQRDW QATHHHAKAP DGLTLLFPMN DLFEKYIAVL LRRALAGSGI
EVIDQGGHRA CLGSFTGGHL ETGEVFRTKP DIMLRRGREI VAIIDTKWKK LSLDPLDRKH
GVSQADVYQL MAYARLYQTA ELMLLYPARP GQVCAERAQF GMAGGSERLR IAMADVSLDE
KALAEALGVL VMAPAVTKAS PLPQAVG