Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_0021 |
Symbol | |
ID | 5751713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 25841 |
End bp | 27163 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641286262 |
Product | proline dipeptidase |
Protein accession | YP_001552463 |
Protein GI | 160873147 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.215078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0114173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAGT TGGCTCATCA CTATCGTGCC CATATTGCCG AGTTAAACCG TCGAGTCGCA GAGATTTTGT CTCGAGAAGC CTTGTCTGGT TTAGTGATCC ATTCGGGTCA GCCGCATCGG ATGTTTTTGG ATGATATCAA TTATCCCTTT AAGGCAAACC CGCACTTCAA GGCATGGTTG CCTGTGTTGG ATAATCCGAA TTGTTGGTTA GTGGTCAACG GCCGTGATAA GCCGCAGCTG ATTTTTTATC GTCCTGTGGA TTTTTGGCAC AAAGTGTCTG ATGTGCCTGA TATGTTTTGG ACTGAGTATT TCGATATTAA GCTGCTGACC AAGGCTGATA AGGTCGCTGA GTTTTTACCG ACAGATATCG CTAATTGGGC CTATTTAGGT GAGCATTTAG ATGTGGCCGA AGTGCTGGGT TTTACCAGTC GTAATCCCGA TGCTGTGATG AGTTATCTGC ATTACCATAG AACGACTAAA ACCGAATATG AGCTGGAATG CATGCGCCGC GCGAATCAAA TTGCGGTGCA GGGACATTTG GCGGCTAAAA ATGCCTTTTA TAATGGTGCG AGCGAGTTCG AAATCCAGCA GCACTATTTA TCTGCCGTAG GCCAGAGCGA AAATGAAGTG CCCTATGGCA ATATCATCGC TCTTAACCAA AATGCGGCGA TTTTGCATTA CACCGCACTT GAACACCAAA GCCCTGCGAA ACGTTTGTCA TTTCTTATCG ATGCCGGCGC GAGTTACTTT GGCTATGCCT CTGATATCAC CAGAACTTAT GCATTCGAGA AGAATCGTTT CGATGAGTTG ATCACTGCAA TGAACAAGGC GCAGCTAGAG CTTATCGACA TGATGCGTCC GGGTGTGCGT TATCCCGATT TACACTTGGC CACCCATGCT AAAGTCGCGC AAATGCTATT GGATTTTGAT TTAGCCACAG GTGATGTCCA AGGTTTGATT GATCAAGGCA TAACCAGTGC TTTCTTCCCC CATGGCTTAG GTCACATGTT AGGCCTACAA GTGCATGATG TTGGCGGCTT CTCCCACGAT GAACGCGGAA CTCATATTGC GGCGCCAGAG GCCCATCCAT TCCTACGTTG CACCCGCATT TTAGCGCCAA ACCAAGTGCT GACCATGGAA CCTGGGTTAT ACATTATCGA TACTCTGCTC AATGAGCTTA AACAAGATAG TCGTGGCCAA CAGATCAACT GGCAAACGGT TGATGAGTTA AGACCTTTTG GCGGTATTCG CATCGAAGAT AACGTCATAG TGCATCAAGA TAGAAACGAG AACATGACCC GTGAACTCGG TTTGACCGAT TGA
|
Protein sequence | MDQLAHHYRA HIAELNRRVA EILSREALSG LVIHSGQPHR MFLDDINYPF KANPHFKAWL PVLDNPNCWL VVNGRDKPQL IFYRPVDFWH KVSDVPDMFW TEYFDIKLLT KADKVAEFLP TDIANWAYLG EHLDVAEVLG FTSRNPDAVM SYLHYHRTTK TEYELECMRR ANQIAVQGHL AAKNAFYNGA SEFEIQQHYL SAVGQSENEV PYGNIIALNQ NAAILHYTAL EHQSPAKRLS FLIDAGASYF GYASDITRTY AFEKNRFDEL ITAMNKAQLE LIDMMRPGVR YPDLHLATHA KVAQMLLDFD LATGDVQGLI DQGITSAFFP HGLGHMLGLQ VHDVGGFSHD ERGTHIAAPE AHPFLRCTRI LAPNQVLTME PGLYIIDTLL NELKQDSRGQ QINWQTVDEL RPFGGIRIED NVIVHQDRNE NMTRELGLTD
|
| |