Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0990 |
Symbol | |
ID | 3915772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1031323 |
End bp | 1034208 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443724 |
Product | peptidase M16-like |
Protein accession | YP_496269 |
Protein GI | 87199012 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAATTC GTACCACGCG CGCCCGGCGC ATAGGCCTGC TTCTCCCGCT CCTGCTGCTG ACTGCCGCGC CGATTGCGGC GCGCGCACCG GAAAGCGCCG GCGTTCCCTG GCTGTACAAG GGGTCGGACG TTCCGCAGGA CAAGGCCTGG ACTTTCGGCG TCCTGCCCAA CGGCATACGC TATGCCGTGC GCCACAACGG CGTCCCGCCC GAGCAGGTCT CCATTCGCGT CCTCGTCGAT GCCGGCTCGA TGTACGAGAC CGAAAGCCAG CGCGGCTATG CGCACCTGAT CGAGCACCTG ACCTTCCGTG AATCGAAATA CCTCAAGGAA GGCGAGGCGA TTCCGACCTG GCAGCGCCTT GGCGCGACCT TCGGCAGCGA CACGAACGCC GAGACCAGCC CGACGCAGAC CGTCTACAAG CTCGACATCC CGAACGCGAC CGATGCCAAG CTGGACGAGA CGTTCAGGCT CCTGTCCGGA ATGATTACCG CGCCGATCTT CACCGACCAC GGCGTGAAGA CCGAAGTCCC CATCGTGCTT GCCGAAATGC GCGAGCGCAC GAGCCCGCAA TCGCGCGTGC TGGACGAGAC GCGTGGCCTG TTCTTCAAGG GGCAGCTTCT TGCCTCGCGC AATCCCATCG GCACGGTGCA GACGCTCGAG GCGGCGAACG CGGCCGCGGT CAAGGCATTC CACGACAAGT GGTACCGGCC CGACAACACC GTGATCGTGG TCGCCGGCGA TGCCGATCCC GCTGCTCTGG TGGCACGGAT CAAGCAGTCG TTCGGCGGCT GGAAGGCCAC CGGCAAGAAG CCGCTTCAGC CGGATTTCGG CAAGCCTCTG GCGCCTGCCG GTGCGGACCC GAAGAATCCG GTTGGCGAGG CAAAGGTGCT CGTCGAACCC GATCTTCCGC GCATCATCAA CTGGGCGATC CTGCGTCCAT GGGTCAAGGT CAACGACACG ATCCAGTACA ACCAAGGGCT GATGATAGAC CGGCTGGCGC TGGCCTTGAT CAATCGCCGG CTTGAAGCCC GCGCCCGGGG CGGCGGCAGC TATCTGGTCG CATCGGTTGA CGAGATGAAG CAGGAACTGT CACGCTCGGC CGATGCTACG GTCGTGACCG TGACACCGCT TGGCGAGGAC TGGAAGGGCG CGGTCAAGGA CGTGCGAGCG GTCATTGCCG ATGCCCTTGC CACGCCCCCC TCGCAGGAGG AGATCGACCG CGAGGTCGCC GAGTTCGAGG TGGCCTTCAA GGTCTCGGTC GAGACGCAGA CAACAATCGC CGGATCGAAG GCGGCGGACG ACATCGTCAA TGCTGTCGAC ATCCGCGAGA CAGTCGCCAA TCCCGACACG GTCTACGACA TCTTCAAGCG GTCGATCCCG CTGTTCAGGC CCCAGGCGGT GCTCGATCAC ACGCGCGGCC TGTTCAAGGG GACGGTCGTC CGTCCGCTCA TGATCACGCC GAAGGCCGGG GAGGCCGACG AGGCCTCGCT TCGAGCAGCG CTGACGGCCC CGGTCGATGC CGCATCGGGA AGCCGCGTCG CGGCAAACGG CCTCAAGTTC TCGGACCTGC CGGCTGTCGG CGTGCCCGGC ACGGTCGCCA CGGCGCGGCC GATCGGCTTG CTCGGCATCG AGCAGATAGA ACTGTCTAAT GGGGTCAAGG TCCTGCTCTG GCCGAACGAC GCCGAGCCGG GCCGGATCAT CATCAAGGCG CGCTTCGGTG GCGGCTATTC GGCCATCTCC CCGCAGGACG CGGTCTATGG GCCGCTCGCG GAGGTGGCGC TGATGGACAG CGGCATCGGC GAACTCGGGC GCGACGATCT CGATCGCCTG GCCACCGGCC GCAAGCTCAG CCTCGATTTC GACATCGACG ATACTGCGTT CGTCATGTCC TCGGACACGC GCCCGGCGGA CCTTCAGGAC CAGCTCTATC TCATGGCGGC CAAGCTTGCG ATGCCGCGCT GGGACCCGAA CCCGGTGCTG CGCGCCAAGG CGGCGGCGAA GCTCCAGTAC GAAAGCTACA ACTCCGCCCC GATGGCCGTG CTCAACCGCG ACCTGACGTG GCTGCTGCGC GATGGCGATC CGCGTTATGC CACGCCCAAC CCGGCGGAGC TTGACCGGGC GACCCCCGAA GGCTTCCGCA AGACCTGGGA GCAACTCCTT GCGCAGGGGC CGATCGAGAT CGACATGTTC GGCGATTTCA CGCGCGAACA GGCGCTTGCG GCGCTTGAAA AGACCTTTGG CGCGCTTCCG GCGCGACAGC CGGCGCCTGC CTCCACGCTG GCGCCGTCGA TCCCGGCGCA CAATGCCGAA CCCCTTGTGC TGACGCATCG CGGCGATCCC TCGCAGGCTG CCGCTGTCGT TGCATGGCCC ACGGGCGGGG GGCAGGCGGG GGTCCGCGAG AGCCGCCAGC TCGAGATTCT CGCCCAGATC TTCAACAATC GCCTGTTCGA CGCGATGCGC GAGAAGGTGG GGGCGAGCTA CGCGCCGCAA GTGGGATCGA GCTGGCCGCT TGATCTGCCC TCGGGCGGCT ATATCGCGGC GACCGTGCAA GTGCGTCCGG GCGATTTCGA GACCTTCTTT GCCGCGGCCG ACAAGATCGC CGCCGACCTC GTCGCCACGC CTCCGACTGC CGACGAGATC GCGCGCGTTA CCGAACCGCT CAAGCAGCTC ATCACCCGCG CCAGCACCGG CAACGGCTTC TACATGTTCC AGCTTGAAGG GGCCGCCAAC GATCCGCGCA AGATCGCCGC GATCCGGACC ATCCTCAACG ACTACAGCCA GACCACGCCT GAACGGATGC AGGCGCTGGC CGAGCGTTAT CTGCGCAAGG ACAAGAGCTG GCGGCTCGAG GTGGTGCCGG AAAAGGCGGG GCAGGCGACG CCCTGA
|
Protein sequence | MRIRTTRARR IGLLLPLLLL TAAPIAARAP ESAGVPWLYK GSDVPQDKAW TFGVLPNGIR YAVRHNGVPP EQVSIRVLVD AGSMYETESQ RGYAHLIEHL TFRESKYLKE GEAIPTWQRL GATFGSDTNA ETSPTQTVYK LDIPNATDAK LDETFRLLSG MITAPIFTDH GVKTEVPIVL AEMRERTSPQ SRVLDETRGL FFKGQLLASR NPIGTVQTLE AANAAAVKAF HDKWYRPDNT VIVVAGDADP AALVARIKQS FGGWKATGKK PLQPDFGKPL APAGADPKNP VGEAKVLVEP DLPRIINWAI LRPWVKVNDT IQYNQGLMID RLALALINRR LEARARGGGS YLVASVDEMK QELSRSADAT VVTVTPLGED WKGAVKDVRA VIADALATPP SQEEIDREVA EFEVAFKVSV ETQTTIAGSK AADDIVNAVD IRETVANPDT VYDIFKRSIP LFRPQAVLDH TRGLFKGTVV RPLMITPKAG EADEASLRAA LTAPVDAASG SRVAANGLKF SDLPAVGVPG TVATARPIGL LGIEQIELSN GVKVLLWPND AEPGRIIIKA RFGGGYSAIS PQDAVYGPLA EVALMDSGIG ELGRDDLDRL ATGRKLSLDF DIDDTAFVMS SDTRPADLQD QLYLMAAKLA MPRWDPNPVL RAKAAAKLQY ESYNSAPMAV LNRDLTWLLR DGDPRYATPN PAELDRATPE GFRKTWEQLL AQGPIEIDMF GDFTREQALA ALEKTFGALP ARQPAPASTL APSIPAHNAE PLVLTHRGDP SQAAAVVAWP TGGGQAGVRE SRQLEILAQI FNNRLFDAMR EKVGASYAPQ VGSSWPLDLP SGGYIAATVQ VRPGDFETFF AAADKIAADL VATPPTADEI ARVTEPLKQL ITRASTGNGF YMFQLEGAAN DPRKIAAIRT ILNDYSQTTP ERMQALAERY LRKDKSWRLE VVPEKAGQAT P
|
| |