Gene Sde_0552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0552 
Symbol 
ID3967895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp673294 
End bp674265 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content53% 
IMG OID637919615 
Productprolyl aminopeptidase 
Protein accessionYP_526028 
Protein GI90020201 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.429543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTT TATTTCCGGA AATTAAGCCC TACGCCACCC ACGAGCTAGC CGTTGATGAC 
GTGCATACGC TCTACGTAGA GGAAAGTGGC GACCCCGGCG GTATTCCGGT GCTGTTTGTA
CACGGTGGGC CAGGGGCAGG CTGCAGCAAG CATGACCGCC GCTTTTTTAA CCCCGAGCTG
TACCGCATTA TTTTGTTTGA TCAACGCGGC GCTGGCCGCT CTAAACCGCA TGCCGAATTG
GAGCACAACA CCAGCCAACA CCTAGTGGAG GATATGGAAA AGATTCGTGA ATTTCTCTCC
GTTGATAAAT GGGTACTTTT CGGCGGGTCG TGGGGCTCTA CCCTTAGCTT ACTGTATGCG
CAGGCTTACC CACAAAACGT GTTGTATATG ATTTTGCGCG GTATCTTTTT GTGCAGAGAG
CAAGACTTAC AGTGGTTTTA TCAAGCGGGA GCTGACCGCA TTTTTCCTGA CTACTGGCAG
GATTACCTCG CCCCTATCGC CGAGAATGAA CGCGACGACA TGATAGGTGC GTACTATAAA
AAACTTACCG GCTCTAACGA GCTGGCTAAA ATGTCTGCCG CTAAGGCTTG GTCACAATGG
GAAGGCCGCT GCGCTACCCT GCGCCCCAAC CCCGATGTAG TAGACCGCTT TACCGACCCC
CATATGGCCG TTTCACTGGC GCGTATAGAA GCTCACTACT TTGTAAATTG CGGCTTTATG
AGCCCCAACC AAATTATTAA TAACGCGCAG ACATTAGCGG GCATTCCCGC CACAATTATT
CACGGCCGCT ACGATATGGT GTGCCCGCTA GACAACGCCT TTGCCCTTGC GGAAGCTTGG
CCCACGGCCA AATTACATAT TATTCGCGAC GCCGGCCACT CTTCATCTGA GCCCAGCGTA
GTAGATGCGT TGGTACGCGT TACCCACGAC GTAGCCCAAG AGCTTTCTGG CGATGGCGAC
GAAACGAGTT GA
 
Protein sequence
MQILFPEIKP YATHELAVDD VHTLYVEESG DPGGIPVLFV HGGPGAGCSK HDRRFFNPEL 
YRIILFDQRG AGRSKPHAEL EHNTSQHLVE DMEKIREFLS VDKWVLFGGS WGSTLSLLYA
QAYPQNVLYM ILRGIFLCRE QDLQWFYQAG ADRIFPDYWQ DYLAPIAENE RDDMIGAYYK
KLTGSNELAK MSAAKAWSQW EGRCATLRPN PDVVDRFTDP HMAVSLARIE AHYFVNCGFM
SPNQIINNAQ TLAGIPATII HGRYDMVCPL DNAFALAEAW PTAKLHIIRD AGHSSSEPSV
VDALVRVTHD VAQELSGDGD ETS