Gene SO_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_2047 
Symbol 
ID1169796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp2145107 
End bp2147200 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content47% 
IMG OID637343918 
Productprolyl oligopeptidase family protein 
Protein accessionNP_717650 
Protein GI24373607 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTA AATTACTGCC TTGGTGCATT GCTGGTGTAC TCACTATGAC TGCTCAAGTC 
CACGCACAAG AAGACAAGTA TATTTGGCTC GAAGAGGTTG AAGGCGCAAA GCCTATGGAA
TGGGTTAAGA CCCAAAATGC CGCTTCTGCC GCTGAAATTA AAGCCTTTAA AGGTTTCGAT
ACCTTAGTCG CCAACAGCCT CGCTATCCTT AACGATAAAG AGCGTATTCC CTACGCCACC
CATATTGGCG ATAAGCTGTA TAACTTCTGG AAGGACGATA CCCATGTGCG GGGGATTTAC
CGTCGCACGA CGATGGAAGA ATACGCTAAG GTAGATCCCA AGTGGGAAAC TGTGCTCGAT
GTCGATGCTT TGGGTAAGTC GGAATCCGTA AACTGGGTGT TTAAGAGCAT TGATTGCCAA
TATCCCGATA ATCAGCGCTG CTTTGTGTCT TTATCCCGTG GCGGCGCCGA TGCGGTGGAA
GTGCGTGAAT TTGATTTAAC CACTAAAGAT TTTGTGCCAG TAAAAGAGAA GCCATTCTTT
TTAAAAGAAG CGAAATCTAG CCTAAGTTGG ATAGATAAAG ATCATGCATT TGTCGGCACC
GACTTTGGCG ATGGCCAAAG CATGACTGAC TCAGGTTACC CCCGCGTCGT GAAGTTATGG
CAACGTGGTA CCCCACTTAC CAACGCTAAG ACGATTTTTA GTGGAGATAA AACCTCTGTC
GCGGTATCGG GTTGGGTGAT GTTTGACGAT AAAACCCCGC TCAGCCTTGT CACAGAAGCA
CATACCTTTT ATACCGCGAC GCAGTATGTT TACCAAGACG GCAAACTGAT TAAGCTACCC
ATCCCACAAG ATGCTGAGAT TAAAGGTTAT TTCCAAGGTA AGTTGTTTAT TGAGCTAAAG
AGTGACTTAG CAACGCCTGC GGCGACATTC AGCCAAGGCG CTGTGGTGTA CGCTAATGTG
GCCGATTTAA TTGCCCAAAA AGCCTCCTTC ACTGAATTTG TGAGCCCAAC CCCGACAGCA
TCTATTGCGC AGCTAAGTTT CAGTAAGAGC GCGATTTTTG TTAATTGGCT CGATAACGTA
AAAAGCAAAT TGGTGCGTTA TGAGCAAGAT GACAAGGGCA CATGGCAAAG CATGCCAGTC
CCCTTTGAGG CCAATGGTGC GCTAACGGTG ATGGATATGG AGCGCGACAG TGATGATTTC
TTTGTTAATT ACACTAGCTT CCTTGAGCCA TCGAGCCTGT ATACCGTCAA TGCTAAGGCA
CTTAAACCTC AAAAAATCAA AGGTATGCCT CAGCAGTTTG CTGCGGATAA ATTTAAAACC
AAGCAATATT TTGCCACCTC AAAGGATGGC ACCAAAGTGC CTTATTTTGT GGTTATGGCA
AAAGATCTCA AGTTAGATGG CCGTAATCCA ACATTGCTCT ATGGTTATGG CGGATTTGAA
GTGTCTCTTC GCCCAGCATA TTCAGCCACC ATTGGTAAAA ACTGGTTAGA ACAGGGTGGA
GTCTATGTGC TGGCCAATAT CCGTGGCGGT GGAGAATATG GCCCAGCGTG GCATCAAGCG
GCGCTAAAAC AGAATCGTCA TAAAGCTTAT GAAGACTTTG AGGCCATTGC TGAGGATTTG
ATCGCCCGTA AGATCACCTC GAGCAAGCAT TTAGGTATTC AAGGCGGCAG TAACGGTGGT
TTGCTGATGG GCGCAGCCTT TACCCGTCGT CCCGATCTCT ACAATGCCGT CGTGTGCCAA
GTGCCATTAC TCGATATGTA CCGCTTTAAC AAGTTACTCG CGGGGGCCAG TTGGATGGGG
GAATACGGTA ATCCCGATAT TCCACAGGAG TGGGACTATA TTAAAACCTA CTCGCCATAC
CATAACCTGC ATAAAGATGT GCATTATCCA AAGGTCTTTT TCACTACCTC AACTCGTGAC
GATAGGGTGC ATCCTGGGCA CGCCCGTAAG ATGGTTGCTA AGATGAAGGA TATGGGCATT
GATGTGCTTT ACTACGAAAA TATCGAAGGC GGGCATGCGG GGGCAGCAGA TAATAATCAA
GCTGCGGAAC TAAATTCAAT GGCGTTTGCC TACTTATTAC AGCAGTTAAA ATAA
 
Protein sequence
MNRKLLPWCI AGVLTMTAQV HAQEDKYIWL EEVEGAKPME WVKTQNAASA AEIKAFKGFD 
TLVANSLAIL NDKERIPYAT HIGDKLYNFW KDDTHVRGIY RRTTMEEYAK VDPKWETVLD
VDALGKSESV NWVFKSIDCQ YPDNQRCFVS LSRGGADAVE VREFDLTTKD FVPVKEKPFF
LKEAKSSLSW IDKDHAFVGT DFGDGQSMTD SGYPRVVKLW QRGTPLTNAK TIFSGDKTSV
AVSGWVMFDD KTPLSLVTEA HTFYTATQYV YQDGKLIKLP IPQDAEIKGY FQGKLFIELK
SDLATPAATF SQGAVVYANV ADLIAQKASF TEFVSPTPTA SIAQLSFSKS AIFVNWLDNV
KSKLVRYEQD DKGTWQSMPV PFEANGALTV MDMERDSDDF FVNYTSFLEP SSLYTVNAKA
LKPQKIKGMP QQFAADKFKT KQYFATSKDG TKVPYFVVMA KDLKLDGRNP TLLYGYGGFE
VSLRPAYSAT IGKNWLEQGG VYVLANIRGG GEYGPAWHQA ALKQNRHKAY EDFEAIAEDL
IARKITSSKH LGIQGGSNGG LLMGAAFTRR PDLYNAVVCQ VPLLDMYRFN KLLAGASWMG
EYGNPDIPQE WDYIKTYSPY HNLHKDVHYP KVFFTTSTRD DRVHPGHARK MVAKMKDMGI
DVLYYENIEG GHAGAADNNQ AAELNSMAFA YLLQQLK