Gene Saro_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1038 
Symbol 
ID3915820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1076575 
End bp1079421 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content69% 
IMG OID640443772 
Productpeptidase M16-like 
Protein accessionYP_496317 
Protein GI87199060 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0872842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCTGC GCCGCCTGCT TCCCCTCGCC GCTGCCCTGT CCCTCGCCGC CTGCGCCGCG 
CAGCCCGCGC AACTCGCGAG CGCACCCAAA GAAACCTGGG CCTTCCAGCG CAGCGACTTG
CCGCCCGATC CCGCCTTTCG CTACGGCCAG CTGCCCAACG GCATGCGCTT CATCATCCGC
AAGAACGCCA CGCCCGCAGG GACCGCGCAA GTGCGCATGG ATGTCGCCAC CGGCTCGCTC
GACGAGCGCG AGAGCGAGCG CGGCTTTGCC CATTTCGTCG AACACATGGC CTTCAACGGC
TCCACCCGCG TGCCCGAAGG CGAGATGGTC AAACTGCTGG AACGCAACGG CCTTTCCTTC
GGGGCGGATA CCAACGCGCA GACCTCGTTC GAGCAGACGC TCTACATGCT GGACCTGCCG
AGGAACGACG CGAAGCTTCT CGACACCGCG CTCATGCTCA TGCGCGAGAC GGCAAGCGAA
CTGACATTCG ACCCCGAAGC AGTCACCCGC GAACGCGGCG TGGTCCTGTC CGAACTGCGC
GACGGACAGG GCTGGCAGCG CACCAATCTC GAAGACCAGC TCGCCTTCTT CTATCCCGCC
GCCACCTACC CCCGGCGCTT GCCCATCGGC ACTGTCGAGG CCCTCAACGC CGCGACGGCG
GATACGCTCA GGGCCTTCTG GTCACGCGAA TACGTGCCCT CGAAAACCAC GCTCGTCATC
GTCGGAGACT TCGATCCCGA CGTTGTGGAG CAGGCGATCC GCACCCGCTT CGCCGACTGG
CAGCCCCAGG CCGAAACCCC GCGCCCCGAC CAGGGCAAGG TTCTTACGAA GCAGAAGGGC
GCGGTCGACA TCCACCTCGA TCCATCGCTC TCCGAACGCG TCACCGCCTC GCGCCACGGG
CCTTGGCTGG ACGAGCCCGA CACGATCGCC AACCGCCGCC GCAATCTCCT GCGCCAGATC
GGCTACGGCG TGGTCAACCG CCGCTTCCAG CGCATGAGCC GGACGATCGA CCCGCCGTTC
CGCGGCGCGG GGCTGGGAAC GAGCGAGGTG TTCAGGATCG GCCGCACCAC CAATCTCATC
GTCGACACCG TCGACGGCGG ATGGCAGCGC GGTTTCGCCG CCGCCGCCGC CGCCTATGCC
CGGGCGCTTG CCACGGGCTT CACCCAGGTC GAGATCGACG AGCAGGTCGC CAACATCCGC
ACCGGGCTGG AGAATGCGGC CGCAGGCGCC GATACGCGCC CGCACGGCAC GCTGGTCAAC
GCCGCCCTCG CCCTCGTGCG CGACGAGCAG GTGCCCACGA CCCCGCAGTC CGGACTCGAC
CGCTTCAACC GCTTCGCCGC CACGATCACC CCGCAAACGG TAATGGCCGC GCTCAAGGAA
GAAGCGGTTC CCCTCAAGGC CCCGCTGATC CGCTTCCAGG GCCGAACCGC GCCCAAGGGC
GGGGCCGAAG CGCTGCGCAA GACCTGGGAC AAGGCCACCC GCGCCAGGGC AGCCGCTGGC
GAAATCCCCG CACCGACGGC CTTTGCCTAT AACGATTTCG GTCCCGCCGG CGCGGTCGTC
TCAGACACGG TCGAACCGCT CTACGCCATC CGCCAGATCC GCTTCGCCAA CAATGTCCGC
CTCAACCTCA AGCGTACCGA CCTCGCGCGG GACCGGGTCG AGGTCCGCCT CAATCTCGAC
GGCGGCGAAA TGCTCGATAC CCCCGCCCAA CCCCTCGCCA CCGAGATGAC CGGCGTTCTC
GCGCGCGGCG GCCTCGGCAA GCACAGCGAG GACGACCTCC AGACGCTGCT GGCCGGACGT
TCGGTAGTCA TGGGCCTCGG CCCTGGCGGC GACACCTTCG GCAGCGACGC GGTAACGACC
CCGCGCGACC TCCAGCTCCA GCTCCAGCTC TGGGCCGCGC TTCTCACCGA CCCCGGCTAC
CGCCCGGAAG GCGAGGTCCT CTATCGCCAG AACATCGCCA ACTTCTTCGC CCGCTTGCGC
TCTTCCCCGG GTGCGGCGCT GTCCAATGCA ATCGGCGGCA TCCTTTCCGA CAACGACCCG
CGCTTCACGC TCCAGCCCGA AAGCGCCTAC ACCGCGCTGA CATATGCAAA GCTGCGGGAA
GCCATCGCGG ACCGGCTCAC CCACGGCGCG ATAGAGGTCG CCATCGTCGG CGACATCGAC
GAGGCAGCCG CGATAGACGC CGTTGCCCGC ACCTTCGGCG CCCTTCCCCC GCGCGAGGCA
GACTTCCGCG CCTACAGCGC CGAACGCATG CGCGGCTTCA CGAGCAAGCG CGGCCCGGTC
ATCGTGCGCC ACACCGGCGA GGCCAACCAG GCGCTCGTCC GCTATGTCTG GCCCACCCGC
GACGATCGCG ATCCGGAAGA AGCCATGGCG CTCTCCCTGC TCAAGGAAGT GGCAGAAGTC
GAAGTGCTCG ACACGATCCG GGAAAAGCTC GGCAAGGCCT ATTCGCCCGG CGCTGCCAGC
AGCCTCAGCC ACGTCTGGCC GGGCTATGGA ACGTTCGTCC TCGCGGCTTC CGTCGATCTG
GCGGACGTTG CAGCCACCCG CACCGCGCTC GACCAGACGG TCCGCGCGCT GGCCGCCGCA
CCGGTCGATG CGGACGTGCT CCAGCGTGCC CGCGCACCCA TGCTCGAACG CATCGACAAC
GCGCTCAAGA CCAACGGGGG ATGGATGGCC CTTGCCGAAC GCGCCCAGAC CGAACCCGAA
CGCCTCGCCC GCGCAAGGTC CGCCCGCGCA CGGCTGGAAC GGCTGACGGC GGTAGACCTC
CAGGCCCTTG CCCGCCGCTA CCTCTCGCCG GACAAGGCGG TGCAGGTACT TGTCCTGCCC
GACGGCGCGC CCGCGCCGGA AAAGTGA
 
Protein sequence
MILRRLLPLA AALSLAACAA QPAQLASAPK ETWAFQRSDL PPDPAFRYGQ LPNGMRFIIR 
KNATPAGTAQ VRMDVATGSL DERESERGFA HFVEHMAFNG STRVPEGEMV KLLERNGLSF
GADTNAQTSF EQTLYMLDLP RNDAKLLDTA LMLMRETASE LTFDPEAVTR ERGVVLSELR
DGQGWQRTNL EDQLAFFYPA ATYPRRLPIG TVEALNAATA DTLRAFWSRE YVPSKTTLVI
VGDFDPDVVE QAIRTRFADW QPQAETPRPD QGKVLTKQKG AVDIHLDPSL SERVTASRHG
PWLDEPDTIA NRRRNLLRQI GYGVVNRRFQ RMSRTIDPPF RGAGLGTSEV FRIGRTTNLI
VDTVDGGWQR GFAAAAAAYA RALATGFTQV EIDEQVANIR TGLENAAAGA DTRPHGTLVN
AALALVRDEQ VPTTPQSGLD RFNRFAATIT PQTVMAALKE EAVPLKAPLI RFQGRTAPKG
GAEALRKTWD KATRARAAAG EIPAPTAFAY NDFGPAGAVV SDTVEPLYAI RQIRFANNVR
LNLKRTDLAR DRVEVRLNLD GGEMLDTPAQ PLATEMTGVL ARGGLGKHSE DDLQTLLAGR
SVVMGLGPGG DTFGSDAVTT PRDLQLQLQL WAALLTDPGY RPEGEVLYRQ NIANFFARLR
SSPGAALSNA IGGILSDNDP RFTLQPESAY TALTYAKLRE AIADRLTHGA IEVAIVGDID
EAAAIDAVAR TFGALPPREA DFRAYSAERM RGFTSKRGPV IVRHTGEANQ ALVRYVWPTR
DDRDPEEAMA LSLLKEVAEV EVLDTIREKL GKAYSPGAAS SLSHVWPGYG TFVLAASVDL
ADVAATRTAL DQTVRALAAA PVDADVLQRA RAPMLERIDN ALKTNGGWMA LAERAQTEPE
RLARARSARA RLERLTAVDL QALARRYLSP DKAVQVLVLP DGAPAPEK