Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_4252 |
Symbol | |
ID | 1171856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | + |
Start bp | 4428521 |
End bp | 4430509 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637345978 |
Product | prolyl oligopeptidase family protein |
Protein accession | NP_719779 |
Protein GI | 24375736 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTT TGTCTTTTGC CGTTTTGGCT GCCATTTGCT TACCCATGAT GCCTATCACC ACATTTGCGG CTGAAACATC GGCCACGAAT GCATTATCAC AATCACAATT ATTTAGTCGA GGCGATGAAT ACGCTAACGT AAAAATATCC CCAACAGGTA AATACTTAAG TGCAATTACC AGTGTCGAAG GTAAAAATGT ACTCCTAGTA TTAGATGCTC AAACCAAAAA ACTGTTAAAT GCGATTCGCT TCCCAAGCAA TGCACAAGTG GGGACCTATG AATGGGCCAG CAGCGAACGC ATCGTGCTGG CAAAAGAATA CCTTAAGGGC TGGAGCGATG TCCCCCAATA CTACGGCGAA TTAATGGCGG TCAACGCCGA CGGTTCGCGT CCTGCCTACC TATTTGGTTT TAATAGTGGC GAGCAACAAA CGGGTTCAAA TATCAAGAAA AACACCGCCA TAAAGGCCAC AGCATTCATT CTGGACCCTC TACCGGATGA TGAACGCTAT ATGCTGGTTA ATGCCATTCC ATGGAATAAC GAGAGCAGCC TCAATCTTGA ATTAAAGCAA GATGTGTATC GAGTTGACCT TTTTAGCGGG GTTCGTAAAC GCATCACAGG CTCCCCCATT GGCCAAGCTC GCTTTATGAC CGACCACGAA GGTGAAGTCC GCTTTGTAAC TGGGGAAGAT GGCCAAAACG TCACTAAAGT GTTTTACCGA AAAGACGGCG ACTGGGTGAA TACCGATAAA CTCAACCTAG GCTTAAGTGA TTTTACGCCT ATCTCCTTCG CTGATAATAA AAATACCATC TATGCTGCGG GTCGCGTGGG CACTGAAACC TTAGGCGTCT ATCGCATCAA TCTCGAAACG GGCGAGAAGA CCGAGATAAT TCAAGATGAG GTAGTCGATC CGAGTAACTT TTGGATCAAT GGCACTAACA AACAGCTGTA TGCCGTTGAG TTTGAAAATG GCTATCCAAG CTATGCCTTT GTCGATAACA ACGATAACCA CGCCAAACTG CTTAAGGATT TAATTGCCGC CCTACCTGGG CATCAAGTAC AGATTGTTAG TGAAACCCGT AATGGCGAGC AACTGGTGGT GATTGCATTT AACGACCGCA ATCCCGGTGA TTACTATTTA TTTGATACTA AAAAGCTCAA ACTCGAGTAC CTCGCCGCCG CTCGCAAATG GCTCGACCCA GAGCAAATGG CGGAGGTTAA ACCTATTAGT TTCACCAACC GCGATGGTCA GAAAATCCAT GGCTATTTAA CCTTACCCTA CGGCAAAGAA GCCAAAAATT TACCGCTGGT CGTTAATCCC CATGGTGGTC CCCATGGTAT TCGTGACTGG TGGGGGTTTG ATCCACAAAA TCAATTACTC GCCCAGAATG GTATGGCGGT GTTGCAGGTT AACTTCCGTG GCTCAGGCGG TTATGGCGAG CGTTTCGAGC AAGCAGGTTA CCAAAAATGG GGCTCAGATA TTCAGCACGA TATTATCGAT GCGACTCAGT ATGTGATTGG TCAAGGCTTT GTCGATAAAG AACGGATTTG TATTGCGGGC GGTAGCTTTG GCGGCTATAG CGCTTTGCAA AGTGCGGTAT TAGCACCCGA TATGTTTAAA TGCGCGGTTG GGTTTGCAGG TGTGTATGAT CTTGAGTTGA TGTTTGATGA AGGCGATGTC GCCAGAACAC GTTCAGGAAC AAGCTATCTT AAGGACGTAC TTGGCCAAGA CAAAGCCACC CTAAAAGCCA TGTCTCCCTC TGAGAACGTT GCCAAATTAA AAGCGAACCT CTTACTCGTT CACGGTGGTG AAGATGAGCG CGCGCCAATT GAACAGTTGG AATCGCTTGA AAAAGCCCTT AAAAACCATA ATTACCCTTA CCAAAAACTG GTGATGGATA ACGAAGGTCA TGGTTTTTAT AACGATACTC ATAGAGCCAA GTATTACGAT CAGATGCTGA GCTTCTTAAA AACCAACTTA AAACTTTAG
|
Protein sequence | MKILSFAVLA AICLPMMPIT TFAAETSATN ALSQSQLFSR GDEYANVKIS PTGKYLSAIT SVEGKNVLLV LDAQTKKLLN AIRFPSNAQV GTYEWASSER IVLAKEYLKG WSDVPQYYGE LMAVNADGSR PAYLFGFNSG EQQTGSNIKK NTAIKATAFI LDPLPDDERY MLVNAIPWNN ESSLNLELKQ DVYRVDLFSG VRKRITGSPI GQARFMTDHE GEVRFVTGED GQNVTKVFYR KDGDWVNTDK LNLGLSDFTP ISFADNKNTI YAAGRVGTET LGVYRINLET GEKTEIIQDE VVDPSNFWIN GTNKQLYAVE FENGYPSYAF VDNNDNHAKL LKDLIAALPG HQVQIVSETR NGEQLVVIAF NDRNPGDYYL FDTKKLKLEY LAAARKWLDP EQMAEVKPIS FTNRDGQKIH GYLTLPYGKE AKNLPLVVNP HGGPHGIRDW WGFDPQNQLL AQNGMAVLQV NFRGSGGYGE RFEQAGYQKW GSDIQHDIID ATQYVIGQGF VDKERICIAG GSFGGYSALQ SAVLAPDMFK CAVGFAGVYD LELMFDEGDV ARTRSGTSYL KDVLGQDKAT LKAMSPSENV AKLKANLLLV HGGEDERAPI EQLESLEKAL KNHNYPYQKL VMDNEGHGFY NDTHRAKYYD QMLSFLKTNL KL
|
| |