Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1038 |
Symbol | |
ID | 3915820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1076575 |
End bp | 1079421 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640443772 |
Product | peptidase M16-like |
Protein accession | YP_496317 |
Protein GI | 87199060 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0872842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCTGC GCCGCCTGCT TCCCCTCGCC GCTGCCCTGT CCCTCGCCGC CTGCGCCGCG CAGCCCGCGC AACTCGCGAG CGCACCCAAA GAAACCTGGG CCTTCCAGCG CAGCGACTTG CCGCCCGATC CCGCCTTTCG CTACGGCCAG CTGCCCAACG GCATGCGCTT CATCATCCGC AAGAACGCCA CGCCCGCAGG GACCGCGCAA GTGCGCATGG ATGTCGCCAC CGGCTCGCTC GACGAGCGCG AGAGCGAGCG CGGCTTTGCC CATTTCGTCG AACACATGGC CTTCAACGGC TCCACCCGCG TGCCCGAAGG CGAGATGGTC AAACTGCTGG AACGCAACGG CCTTTCCTTC GGGGCGGATA CCAACGCGCA GACCTCGTTC GAGCAGACGC TCTACATGCT GGACCTGCCG AGGAACGACG CGAAGCTTCT CGACACCGCG CTCATGCTCA TGCGCGAGAC GGCAAGCGAA CTGACATTCG ACCCCGAAGC AGTCACCCGC GAACGCGGCG TGGTCCTGTC CGAACTGCGC GACGGACAGG GCTGGCAGCG CACCAATCTC GAAGACCAGC TCGCCTTCTT CTATCCCGCC GCCACCTACC CCCGGCGCTT GCCCATCGGC ACTGTCGAGG CCCTCAACGC CGCGACGGCG GATACGCTCA GGGCCTTCTG GTCACGCGAA TACGTGCCCT CGAAAACCAC GCTCGTCATC GTCGGAGACT TCGATCCCGA CGTTGTGGAG CAGGCGATCC GCACCCGCTT CGCCGACTGG CAGCCCCAGG CCGAAACCCC GCGCCCCGAC CAGGGCAAGG TTCTTACGAA GCAGAAGGGC GCGGTCGACA TCCACCTCGA TCCATCGCTC TCCGAACGCG TCACCGCCTC GCGCCACGGG CCTTGGCTGG ACGAGCCCGA CACGATCGCC AACCGCCGCC GCAATCTCCT GCGCCAGATC GGCTACGGCG TGGTCAACCG CCGCTTCCAG CGCATGAGCC GGACGATCGA CCCGCCGTTC CGCGGCGCGG GGCTGGGAAC GAGCGAGGTG TTCAGGATCG GCCGCACCAC CAATCTCATC GTCGACACCG TCGACGGCGG ATGGCAGCGC GGTTTCGCCG CCGCCGCCGC CGCCTATGCC CGGGCGCTTG CCACGGGCTT CACCCAGGTC GAGATCGACG AGCAGGTCGC CAACATCCGC ACCGGGCTGG AGAATGCGGC CGCAGGCGCC GATACGCGCC CGCACGGCAC GCTGGTCAAC GCCGCCCTCG CCCTCGTGCG CGACGAGCAG GTGCCCACGA CCCCGCAGTC CGGACTCGAC CGCTTCAACC GCTTCGCCGC CACGATCACC CCGCAAACGG TAATGGCCGC GCTCAAGGAA GAAGCGGTTC CCCTCAAGGC CCCGCTGATC CGCTTCCAGG GCCGAACCGC GCCCAAGGGC GGGGCCGAAG CGCTGCGCAA GACCTGGGAC AAGGCCACCC GCGCCAGGGC AGCCGCTGGC GAAATCCCCG CACCGACGGC CTTTGCCTAT AACGATTTCG GTCCCGCCGG CGCGGTCGTC TCAGACACGG TCGAACCGCT CTACGCCATC CGCCAGATCC GCTTCGCCAA CAATGTCCGC CTCAACCTCA AGCGTACCGA CCTCGCGCGG GACCGGGTCG AGGTCCGCCT CAATCTCGAC GGCGGCGAAA TGCTCGATAC CCCCGCCCAA CCCCTCGCCA CCGAGATGAC CGGCGTTCTC GCGCGCGGCG GCCTCGGCAA GCACAGCGAG GACGACCTCC AGACGCTGCT GGCCGGACGT TCGGTAGTCA TGGGCCTCGG CCCTGGCGGC GACACCTTCG GCAGCGACGC GGTAACGACC CCGCGCGACC TCCAGCTCCA GCTCCAGCTC TGGGCCGCGC TTCTCACCGA CCCCGGCTAC CGCCCGGAAG GCGAGGTCCT CTATCGCCAG AACATCGCCA ACTTCTTCGC CCGCTTGCGC TCTTCCCCGG GTGCGGCGCT GTCCAATGCA ATCGGCGGCA TCCTTTCCGA CAACGACCCG CGCTTCACGC TCCAGCCCGA AAGCGCCTAC ACCGCGCTGA CATATGCAAA GCTGCGGGAA GCCATCGCGG ACCGGCTCAC CCACGGCGCG ATAGAGGTCG CCATCGTCGG CGACATCGAC GAGGCAGCCG CGATAGACGC CGTTGCCCGC ACCTTCGGCG CCCTTCCCCC GCGCGAGGCA GACTTCCGCG CCTACAGCGC CGAACGCATG CGCGGCTTCA CGAGCAAGCG CGGCCCGGTC ATCGTGCGCC ACACCGGCGA GGCCAACCAG GCGCTCGTCC GCTATGTCTG GCCCACCCGC GACGATCGCG ATCCGGAAGA AGCCATGGCG CTCTCCCTGC TCAAGGAAGT GGCAGAAGTC GAAGTGCTCG ACACGATCCG GGAAAAGCTC GGCAAGGCCT ATTCGCCCGG CGCTGCCAGC AGCCTCAGCC ACGTCTGGCC GGGCTATGGA ACGTTCGTCC TCGCGGCTTC CGTCGATCTG GCGGACGTTG CAGCCACCCG CACCGCGCTC GACCAGACGG TCCGCGCGCT GGCCGCCGCA CCGGTCGATG CGGACGTGCT CCAGCGTGCC CGCGCACCCA TGCTCGAACG CATCGACAAC GCGCTCAAGA CCAACGGGGG ATGGATGGCC CTTGCCGAAC GCGCCCAGAC CGAACCCGAA CGCCTCGCCC GCGCAAGGTC CGCCCGCGCA CGGCTGGAAC GGCTGACGGC GGTAGACCTC CAGGCCCTTG CCCGCCGCTA CCTCTCGCCG GACAAGGCGG TGCAGGTACT TGTCCTGCCC GACGGCGCGC CCGCGCCGGA AAAGTGA
|
Protein sequence | MILRRLLPLA AALSLAACAA QPAQLASAPK ETWAFQRSDL PPDPAFRYGQ LPNGMRFIIR KNATPAGTAQ VRMDVATGSL DERESERGFA HFVEHMAFNG STRVPEGEMV KLLERNGLSF GADTNAQTSF EQTLYMLDLP RNDAKLLDTA LMLMRETASE LTFDPEAVTR ERGVVLSELR DGQGWQRTNL EDQLAFFYPA ATYPRRLPIG TVEALNAATA DTLRAFWSRE YVPSKTTLVI VGDFDPDVVE QAIRTRFADW QPQAETPRPD QGKVLTKQKG AVDIHLDPSL SERVTASRHG PWLDEPDTIA NRRRNLLRQI GYGVVNRRFQ RMSRTIDPPF RGAGLGTSEV FRIGRTTNLI VDTVDGGWQR GFAAAAAAYA RALATGFTQV EIDEQVANIR TGLENAAAGA DTRPHGTLVN AALALVRDEQ VPTTPQSGLD RFNRFAATIT PQTVMAALKE EAVPLKAPLI RFQGRTAPKG GAEALRKTWD KATRARAAAG EIPAPTAFAY NDFGPAGAVV SDTVEPLYAI RQIRFANNVR LNLKRTDLAR DRVEVRLNLD GGEMLDTPAQ PLATEMTGVL ARGGLGKHSE DDLQTLLAGR SVVMGLGPGG DTFGSDAVTT PRDLQLQLQL WAALLTDPGY RPEGEVLYRQ NIANFFARLR SSPGAALSNA IGGILSDNDP RFTLQPESAY TALTYAKLRE AIADRLTHGA IEVAIVGDID EAAAIDAVAR TFGALPPREA DFRAYSAERM RGFTSKRGPV IVRHTGEANQ ALVRYVWPTR DDRDPEEAMA LSLLKEVAEV EVLDTIREKL GKAYSPGAAS SLSHVWPGYG TFVLAASVDL ADVAATRTAL DQTVRALAAA PVDADVLQRA RAPMLERIDN ALKTNGGWMA LAERAQTEPE RLARARSARA RLERLTAVDL QALARRYLSP DKAVQVLVLP DGAPAPEK
|
| |