Gene Sama_1757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1757 
Symbol 
ID4604007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2148076 
End bp2150166 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content52% 
IMG OID639781121 
Productprolyl oligopeptidase 
Protein accessionYP_927632 
Protein GI119774892 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0107679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC GGATACTGCT TCTTGGCGCA GCCTCATTGC TGGCTTTCAA TGCTGTAGCA 
CAAGGGGACC CCTTCATTTG GCTGGAAGAT GTCGAGGGTG AAAAAGCGCT TGCCTGGGTC
AAAACCCAAA ACGATCGCTC TCTTGCAGAG CTTAAAGCCG TCCCAGGTTA TAACGCGTTG
GTGGACAATG CACTGGATAT CCTTAATGAC AAGGCGCGTA TCCCGTACGC GAGTCGCATT
GGTGACCATC TCTATAACTT CTGGAAAGAC GAAGCCAATC CCCGCGGCAT TTACCGACGC
ACCACCATGG CGGAATACGT TAAAGACGCG CCTAAATGGG AAACCGTGCT CGATATCGAT
GCACTTGGTA AATCCGAAGG GGTGAACTGG GTATTCAAGG GCATGGACTG TCAGTATCCA
AAGAATATAC GTTGCCTGGT GTCCCTTTCC CGAGGCGGTG CCGATGCAGT GGAAATCCGC
GAATTTGACC TCTCAACCTT AAGTTTCGTG CCTGCCGACA AGCAGGGTTT TTTCTTGCCG
GAGGCCAAGT CCAGCACAAG TTGGATTGAT GAAAACACCC TGTTTGTGGG TACCGATTTC
GACGAAGGTG ACGCCTGGAC TGACTCTGGC TACCCGCGCA AGGTAAAGGT TTGGAAGCGG
GGCACTGATC TGAAAGACGC CAAAGAAATT TATGCAGGCA ACAAGGCATC GGTGGCGGCA
TCCGGTTACG TCATGTGGGA TGATAAAACG CCACTGCAAT TGGTATCTGA AGCCGAAACC
TTCTACGAAG CCAGTTACAA GGCGCTCCTT GATGGCAAGC TGGTAGAGCT GCCGCTTCCC
AAAGATGCCG AGCTCAAGGG CTTCTTCAAG GGCGATATTT TTGTTGAGCT TAAGAGTAAG
CTCGAGCAGG GCAACAACAG TTTCCCACAG GGCGCCATTG TCTACACCAA GGCAGACAAG
CTGCTGGCCG GAACGCCTGA GTTCGCACTC TTTGTAAAAC CCGACGCCAA TTCTTCCATC
AGTCAGGTGA CCTTTAGCCG CAACGCGGTA CTGGTGAACT GGCTCGAAGA CGTGAAGAGC
AAATTGGTGC GTTATCATAA AGATGCCAAA GGCCAGTGGC AGGGCGAAGA CGTTGGTTTC
CCAAGCAATG GCAGCATCAG TGTGTTCGAC AGCAGCCGCG ACAGAGATGA TCTGTTCGTA
ACCTATACCA GCTTCCTCGA ACCTTCGACC CTGTACAGCG TTAACGCCGA AACCCTCAAG
CGAGACTCGC TCAAGGCGAT GCCTGCCCAG TTTGATGCGT CAAAGTTTGA GGCCAAGCAG
TACTTTGCCA CCAGTAAAGA CGGTACCAAG GTGCCTTACT TTGCGGTGAT GGCCAAGGAC
ATCAAACTCG ACAGCACCAA CCCAACCCTG CTGTATGGCT ACGGTGGTTT TGAGGTATCG
CTGCGTCCTT TCTATTCTGC AACCACAGGT AAAAACTGGC TGGAGCAGGG CGGCGTTTAT
GTACTTGCCA ACATTCGCGG CGGCGGTGAA TACGGTCCCG GCTGGCACCA GGCTGCATTG
AAGGAAAACC GTCATAAAGC CTACGAAGAC TTTGAAGCCA TTGCTGAAGA TCTGATTAAG
CGCAAAATCA CTTCCCCGAA GCATCTGGGT ATTCAGGGTG GCAGTAATGG TGGCCTGCTG
ATGGGGGCTG CCTTTACCCG CAGACCCGAT TTGTACAATG CCGTGGTATG TCAGGTACCG
CTTCTGGATA TGAAGCGTTA CAACAAACTG TTGGCAGGCG CCAGCTGGAT GGGCGAGTAC
GGCAACCCGG ATATCGCCGC CGAATGGGAT TACATTAAAA CCTTCTCCCC ATACCATAAC
CTTAAAAAAG ACGTCGAATA TCCCAAAGTA TTTTTCACCA CATCCACCCG CGATGACCGT
GTGCACCCAG GCCATGCCCG TAAAATGGTG GCCAAGATGG AAGACATGGG CATTGAGGTG
CTGTACTTCG AAAACATGGA AGGCGGCCAT GCCGGCGCGG CAGATAATAA ACAAACTGCG
GAACTAAATA GCCTGGCATA TGCCTACTTA TTGAAGCAGC TTAAGTTGTA A
 
Protein sequence
MNKRILLLGA ASLLAFNAVA QGDPFIWLED VEGEKALAWV KTQNDRSLAE LKAVPGYNAL 
VDNALDILND KARIPYASRI GDHLYNFWKD EANPRGIYRR TTMAEYVKDA PKWETVLDID
ALGKSEGVNW VFKGMDCQYP KNIRCLVSLS RGGADAVEIR EFDLSTLSFV PADKQGFFLP
EAKSSTSWID ENTLFVGTDF DEGDAWTDSG YPRKVKVWKR GTDLKDAKEI YAGNKASVAA
SGYVMWDDKT PLQLVSEAET FYEASYKALL DGKLVELPLP KDAELKGFFK GDIFVELKSK
LEQGNNSFPQ GAIVYTKADK LLAGTPEFAL FVKPDANSSI SQVTFSRNAV LVNWLEDVKS
KLVRYHKDAK GQWQGEDVGF PSNGSISVFD SSRDRDDLFV TYTSFLEPST LYSVNAETLK
RDSLKAMPAQ FDASKFEAKQ YFATSKDGTK VPYFAVMAKD IKLDSTNPTL LYGYGGFEVS
LRPFYSATTG KNWLEQGGVY VLANIRGGGE YGPGWHQAAL KENRHKAYED FEAIAEDLIK
RKITSPKHLG IQGGSNGGLL MGAAFTRRPD LYNAVVCQVP LLDMKRYNKL LAGASWMGEY
GNPDIAAEWD YIKTFSPYHN LKKDVEYPKV FFTTSTRDDR VHPGHARKMV AKMEDMGIEV
LYFENMEGGH AGAADNKQTA ELNSLAYAYL LKQLKL