Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0290 |
Symbol | |
ID | 3916227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 312028 |
End bp | 313980 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443019 |
Product | glycyl aminopeptidase |
Protein accession | YP_495572 |
Protein GI | 87198315 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAGA CGCTCGCGGC AGCACCGCTC CTTCTCGCCC TTTCCGTTTC AACTCAGGCT CTTGCGGCGA ACTCTGCGCC GATGGCCGTG CCGATCGCGC AGACCGTGCC GGACGCGCAG GACGTGGCCT ATCCGGGCAC AATGACGCTC GACATCGACG CGTCGGACGT CGTGCGCGGG GCCTATCGCG TGACCCAGAC CATTCCGGTG GTGGCGGGCG CGAAGGAGCT GATCCTGCTT TACCCGCAGT GGCTGCCCGG CAACCACGGG CCGCGCGGGC CGCTGGCCGA ACTCGTCGGC GTGCAGTTCT TCGTCGACGG CAAGCCGGTG GAATGGAAGC GCGACCGGGT CGAGGTCTAT GCCTTCCACG TCACCCTGCC GGCCGGCGCG AAGGAGGTCG TGGCCAAGCT GATCCACACC TCGCCGCTGC AATCTTCGGA AGGGCGCATT ACGATGACAC CCGAGATGCT CAACCTGCAG TGGGAGAAGA TGAGCCTCTA TCCCGCCGGT CACTATGTCC GCCGCATTCG CGTGAAGCCG ACCGTAACCC TGCCGCAGGG CTGGACGCCC GCCACCGCGC TCGACGGAAT GAGCATGAGC GGCAACCGCG CGACCTGGGC CGAAACCGAC TACGAGACGC TGGTCGATTC GCCGATCTTC GCGGGCAAGA ACTTCCGCAA GTGGGACCTC GGGCAGAACG TCACGCTCAA CGTCGTCTCC GACAAGCCGG AGCAGCTCGA GGCCAAGCCC GAGCATATCG CCGCGCACAA GGCGCTGGTC GAGGAAGCGC GGATCGCGTT CGGCGCGAAC CATTTCGACC ACTACGAGTT CCTGCTGGCG CTGTCGGACA AGATCGGCGG CATCGGGCTG GAACATCACC GGTCGAGCGA GAACCAGCTC GAACCCGAGG CCTTCACCGA ATGGGCCAAG CAGGAATGGG ACCGCAATCT GCTGCCGCAC GAGTATTCGC ACTCGTGGTC GGGCAAGTTC CGCCGTCCGT CGCGCCTGTG GACGCCCGAC TATCGCCAGC CGATGCAGGG CGACCTGCTG TGGACCTACG AAGGGCAGGA CCAGTTCTGG GGCGCGGTGC TGGCCAGCCG TTCGGGCATG CAGGGCAAGG ACATGGTGCT GGGCATGCTG GCGGCATGGG CGGGCGGCTT CACCCAGCAG CCGGGCCGCG AATGGCGTTC GGTCGAGGAT ACCGGGTTCG ACCCGGTCTT CGCCTCGCGC AAGCCGAAGC CCTACTCGTC GCTGGCCCGC AACGAGGACT ACTACACCGA AGGCGCGCTG GTCTGGCTGG AAATCGACCA GATCCTTCGC GAAGGCACCG GCGGCAAGAA GTCCATCGAC GACTTCGCCA AGTCGTTCTT CGGCATGAAT CCGGGGGACT GGGGCCAGAT CCCGTTCGAG GTGGACGAGA TCGTCACCAA GCTGAACGCG CTTTATCCCT ATGACTGGGC CAAGCTGATC GACACCCGCA TCAACCAGCC GGGCCAGCCC GCGCCGCTGA ACGGGATCGA GAAGGGCGGC TACAAGCTGG TCTGGAAGGA AGAGCCCAAT CCCTACATGA AGGCGGCGAT CGATTTCGGC AAAGGCCTGA GCCTTTCCAA CTCGATCGGC ATTTCGCTCG ACAAGGACGG CAAGGTCACC GGCACGCGCT GGGACAGCCC GGCCTTCAAT GCGGGAATCG TGACCGGTAC GCAGATCATG GCGGTGAACG GCACCGCCTA TAGCGCGGAT GACCTCAAGA AGGCGATCAC CGCAGCCAAG GGTGACAAGG GCCAGCCGCT CGAACTGCTG GTCAAGCGCG GCAGCCGGTT CGAGACCGTG AAGCTCGATT ACCGGGATGG CCTGCGTTAT CCGTGGCTCG AGCGCGTGGC GCCGGGCAAG GCGCCGACCG GGCTCGACCT GCTGCTCGAA CCCCGGCGCC CCGGCGCGGC GAAGAAGAAG TAA
|
Protein sequence | MFKTLAAAPL LLALSVSTQA LAANSAPMAV PIAQTVPDAQ DVAYPGTMTL DIDASDVVRG AYRVTQTIPV VAGAKELILL YPQWLPGNHG PRGPLAELVG VQFFVDGKPV EWKRDRVEVY AFHVTLPAGA KEVVAKLIHT SPLQSSEGRI TMTPEMLNLQ WEKMSLYPAG HYVRRIRVKP TVTLPQGWTP ATALDGMSMS GNRATWAETD YETLVDSPIF AGKNFRKWDL GQNVTLNVVS DKPEQLEAKP EHIAAHKALV EEARIAFGAN HFDHYEFLLA LSDKIGGIGL EHHRSSENQL EPEAFTEWAK QEWDRNLLPH EYSHSWSGKF RRPSRLWTPD YRQPMQGDLL WTYEGQDQFW GAVLASRSGM QGKDMVLGML AAWAGGFTQQ PGREWRSVED TGFDPVFASR KPKPYSSLAR NEDYYTEGAL VWLEIDQILR EGTGGKKSID DFAKSFFGMN PGDWGQIPFE VDEIVTKLNA LYPYDWAKLI DTRINQPGQP APLNGIEKGG YKLVWKEEPN PYMKAAIDFG KGLSLSNSIG ISLDKDGKVT GTRWDSPAFN AGIVTGTQIM AVNGTAYSAD DLKKAITAAK GDKGQPLELL VKRGSRFETV KLDYRDGLRY PWLERVAPGK APTGLDLLLE PRRPGAAKKK
|
| |