Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2990 |
Symbol | |
ID | 7089058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 3531708 |
End bp | 3534542 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643461875 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_002358899 |
Protein GI | 217974148 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00257397 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000788258 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAAGTTAA TCCCTATCAC ATCTCTGGTG CTGTTATCCC TTGGAATATT GCCAGCAGTG CAGGCTGAAA CCAGCACTAA ACCGCTCACT CTCACAGACA TCATGCACTT TGAATCCCTT GAAAAGCCGG TTATCGCAGA TAAGGGCCAA GTCATCGCGG TCGAGTCTGC GCCCGATCGC GGGGATAGTC ACGTCATTGT GAAAAATGTG CTGTCGCAGC AAAGTTATCA AATCGCGGGA GGGTCAGATC CTTTAGTGAG TCACGATGGC CGCTTTGTGG CTGTGGTGGT CAAACCGAGT TTGCTCACGC GTGAAACGAG TGATGCCAAG GCGAAGAAAA AACTCAAATC TGACATGGTA TTACTCGATA CCCAAATGGG CACGCAAACC CGCTTTGAGC GCGTGAAGGA ATTCGTGTTC AGTGATGACG GTAAGCATTT GGCAATGTGG TTTGAGGCCG ATGAAGAGTC TAAAAAAGAT GCTGAGCCTA AAGCCATCGA AAAACTCGCT GAAGCAGGCA CACAAACTCA TGCTAAAGCC GATAAGCCAA AGGTTGATAA GTTTGACCAA GGCCGCCGTT TTAGTTTGAT CAGTTTAGAC GATCAAACTA AGCGCATCGA TGTTGAGCAA GTGACTGGTT ATGTGTTTGA TAAGGCGAGC CGCCGACTTG CATTAGCGGT GAATGATATC GCCAATAAAC AGCATCAATT GCAATTAGTG GATTTACATA CCCACCAAAA AACCGTCGTT TTTGATTCGC CTAGCCAACA GGTTGGTGCG TTAGCGCTGG CTAAAAATGG TCGTTGGCTG GCGTTTACGC AGGGCAAAGA TAGCGAGTTG CCCTATGGTC GCAGCTATCA GTTATCGCTG GTCGATTTGC AATCGGGTAA AATCAGCAAA ACTCCTGAGT CACCCGAGTG GAAACTCAAT CGCTACACAA GTTTAAGCTT CTCATTGGAC AGCGAACGGC TGTTCTTCGG CCGCGTGCCA GAGGTGAGCC AGCAGCTTAG TTTGCAGAAA ATCACCGAAG AAAAAGACCT GTTCGATGCC GATATTGTTA CCGGACAGCG CGGGCTTAAG GTGTGGCATG GTGACGACCC ACGGATTAAA CCCCACGAAA TCAAGCAATA TGAGGATGAG CAAAAACGCA CTTACCTTGC GGTATTGCAT TTAGGCTCAA ATAACGTGGT GCAGCTTGGC GATAAAACCG TGCCAGATGT GACCATTTCA CAACATAAGC GCTTTATGTT GGCGAGTTCA GATTTGCCCT ATCGCAAGAT GGCGACGTGG GCTGGCTTTT ATCAAGACTA TTATCTGGTC GACATTAACA CTGGCAGTAA GGTGCCTTTT TTGACTCAGC AGCCGAGCGA TGCCGAGCCA AGTTTGTCGG GTAACGGCAA GTATGTAGCT TATTATCAGC AAGGTAATGT GTATCTTTAT GATATTGCTG AGGCCCATCG CACTAACCTG ACAAGATCAT TAAAGGTCAG CTTTGCCGAT GAAGACCATG ATTATCCATC GAGCGCGCCG GGATACGGTT TTGGCCCTTG GCTGAAAGAT GACGCTGGGT TCTTGGTTTA CGATAAATAC GATGTCTGGC AGTTTAATAC TGAATCTAAG GCTGGTTTTG CCTTAACCGC AGGTAAAGGG CGCACCCAGA AAATTCAATA TCGTCTTGAG GGCTTAGTCG ATAATCCCGA TGAACCGACT GAGCTTGCCT ATAACGCCAC TGTGTTACTC CACGGTTATA GCGATAAAAC TAAGGCCGAT GGCTTCTACC AAGCGACGCT CGGCGCAGCG GGCGTGAAAA CGCTGATGCA AGGCGAGTAT AAGTTGACTG TGTTAGGACG CAGTAAAGAC GCAGACACTA TCGTGTTTTC TAAGGAGCGT TTCGACTTAT TCCCCGACTT GTATACCGCT AACTATAGCG CCCCGCAAAA TGCAGTTAAG CAAACGGATT TAGATAAGCA ACGCCAAGCG TTTAACTGGA GCAAAGCCGA GTTAGTTCAC TGGACGAATG GCGATGGTAA GCCGTTAGAT GGCGTGCTGA TCAAGCCGAC CAATTATCAA GCGGGTCAGC GTGTTCCTGT GTTGGTTTAT TACTATCGTT TTATGACGGA TAGGCTGCAT GCCTTCCCGC AGATGAACAT TAACCACAGG CCTAATTTTG CTTGGTATAT CAATAATGGC TATGCGGTGT TCCTGCCTGA TATTCGTTTT GAAATTGGTT ACCCCGGCGC GAGTTCGGTG CAAGCGCTCA CCTCAGGCGT GCAAAAGCTG ATTGAGATGG GCATTGCCGA TCCCGATGCT ATTGGTCTGC AAGGTCATTC TTGGAGTGGT TATCAAACGG CGTTTGCCAT CACTCAAACT AAGATGTTTA AGGCGGCGGT CGCGGGTGCG CCGGTATCGA ACATGACCAG CGCCTACAGT GGTATTCGCC ATGGTACGGG TATTGCGCGT CAGTTCCAGT ATGAAACTGG ACAGAGCCGT ATTGGCGCGA GTTTGTTTGC CGCGCCACAA AAGTACATTG AAAACTCGCC AGTATTCTAC GCTGACCGAA TTCAAACGCC TTTGATGATC ATGTTTGGTG ATAAGGACGA CGCAGTGCCT TGGGAACAAG GTGTTGAAAT GTACTTGGCC ATGCGCCGTG CGGGTAAAGA TGTAGTGTTC TTACAATATG AGGATGAGCC GCATCACTTG AAAAAGTACC CGAATAAGCT CGATTACAGC ATTCGCATGA TGCAGTATTT CGATCATTAT CTGAAGGGTA AACCCGCGCC AGAATGGCTC AGCAAAGGCG AAGCCTATGT CGAGTACAAG GCCGATGATG AATAA
|
Protein sequence | MKLIPITSLV LLSLGILPAV QAETSTKPLT LTDIMHFESL EKPVIADKGQ VIAVESAPDR GDSHVIVKNV LSQQSYQIAG GSDPLVSHDG RFVAVVVKPS LLTRETSDAK AKKKLKSDMV LLDTQMGTQT RFERVKEFVF SDDGKHLAMW FEADEESKKD AEPKAIEKLA EAGTQTHAKA DKPKVDKFDQ GRRFSLISLD DQTKRIDVEQ VTGYVFDKAS RRLALAVNDI ANKQHQLQLV DLHTHQKTVV FDSPSQQVGA LALAKNGRWL AFTQGKDSEL PYGRSYQLSL VDLQSGKISK TPESPEWKLN RYTSLSFSLD SERLFFGRVP EVSQQLSLQK ITEEKDLFDA DIVTGQRGLK VWHGDDPRIK PHEIKQYEDE QKRTYLAVLH LGSNNVVQLG DKTVPDVTIS QHKRFMLASS DLPYRKMATW AGFYQDYYLV DINTGSKVPF LTQQPSDAEP SLSGNGKYVA YYQQGNVYLY DIAEAHRTNL TRSLKVSFAD EDHDYPSSAP GYGFGPWLKD DAGFLVYDKY DVWQFNTESK AGFALTAGKG RTQKIQYRLE GLVDNPDEPT ELAYNATVLL HGYSDKTKAD GFYQATLGAA GVKTLMQGEY KLTVLGRSKD ADTIVFSKER FDLFPDLYTA NYSAPQNAVK QTDLDKQRQA FNWSKAELVH WTNGDGKPLD GVLIKPTNYQ AGQRVPVLVY YYRFMTDRLH AFPQMNINHR PNFAWYINNG YAVFLPDIRF EIGYPGASSV QALTSGVQKL IEMGIADPDA IGLQGHSWSG YQTAFAITQT KMFKAAVAGA PVSNMTSAYS GIRHGTGIAR QFQYETGQSR IGASLFAAPQ KYIENSPVFY ADRIQTPLMI MFGDKDDAVP WEQGVEMYLA MRRAGKDVVF LQYEDEPHHL KKYPNKLDYS IRMMQYFDHY LKGKPAPEWL SKGEAYVEYK ADDE
|
| |