Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_0552 |
Symbol | |
ID | 3967895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 673294 |
End bp | 674265 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637919615 |
Product | prolyl aminopeptidase |
Protein accession | YP_526028 |
Protein GI | 90020201 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.429543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATTT TATTTCCGGA AATTAAGCCC TACGCCACCC ACGAGCTAGC CGTTGATGAC GTGCATACGC TCTACGTAGA GGAAAGTGGC GACCCCGGCG GTATTCCGGT GCTGTTTGTA CACGGTGGGC CAGGGGCAGG CTGCAGCAAG CATGACCGCC GCTTTTTTAA CCCCGAGCTG TACCGCATTA TTTTGTTTGA TCAACGCGGC GCTGGCCGCT CTAAACCGCA TGCCGAATTG GAGCACAACA CCAGCCAACA CCTAGTGGAG GATATGGAAA AGATTCGTGA ATTTCTCTCC GTTGATAAAT GGGTACTTTT CGGCGGGTCG TGGGGCTCTA CCCTTAGCTT ACTGTATGCG CAGGCTTACC CACAAAACGT GTTGTATATG ATTTTGCGCG GTATCTTTTT GTGCAGAGAG CAAGACTTAC AGTGGTTTTA TCAAGCGGGA GCTGACCGCA TTTTTCCTGA CTACTGGCAG GATTACCTCG CCCCTATCGC CGAGAATGAA CGCGACGACA TGATAGGTGC GTACTATAAA AAACTTACCG GCTCTAACGA GCTGGCTAAA ATGTCTGCCG CTAAGGCTTG GTCACAATGG GAAGGCCGCT GCGCTACCCT GCGCCCCAAC CCCGATGTAG TAGACCGCTT TACCGACCCC CATATGGCCG TTTCACTGGC GCGTATAGAA GCTCACTACT TTGTAAATTG CGGCTTTATG AGCCCCAACC AAATTATTAA TAACGCGCAG ACATTAGCGG GCATTCCCGC CACAATTATT CACGGCCGCT ACGATATGGT GTGCCCGCTA GACAACGCCT TTGCCCTTGC GGAAGCTTGG CCCACGGCCA AATTACATAT TATTCGCGAC GCCGGCCACT CTTCATCTGA GCCCAGCGTA GTAGATGCGT TGGTACGCGT TACCCACGAC GTAGCCCAAG AGCTTTCTGG CGATGGCGAC GAAACGAGTT GA
|
Protein sequence | MQILFPEIKP YATHELAVDD VHTLYVEESG DPGGIPVLFV HGGPGAGCSK HDRRFFNPEL YRIILFDQRG AGRSKPHAEL EHNTSQHLVE DMEKIREFLS VDKWVLFGGS WGSTLSLLYA QAYPQNVLYM ILRGIFLCRE QDLQWFYQAG ADRIFPDYWQ DYLAPIAENE RDDMIGAYYK KLTGSNELAK MSAAKAWSQW EGRCATLRPN PDVVDRFTDP HMAVSLARIE AHYFVNCGFM SPNQIINNAQ TLAGIPATII HGRYDMVCPL DNAFALAEAW PTAKLHIIRD AGHSSSEPSV VDALVRVTHD VAQELSGDGD ETS
|
| |