Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_0014 |
Symbol | |
ID | 5081489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | + |
Start bp | 19735 |
End bp | 21057 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640497130 |
Product | proline dipeptidase |
Protein accession | YP_001181549 |
Protein GI | 146291125 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.258964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCAGT TGGCTCATCA CTATCGTGCC CATATTGCCG AGTTAAACCG CCGAGTCGCA GAGATTTTGT CTCGAGAAGC CTTGTCTGGT TTAGTGATCC ATTCGGGTCA GCCGCATCGG ATGTTTTTGG ATGATATCAA TTATCCCTTT AAAGCAAACC CGCACTTCAA GGCATGGTTG CCAGTGTTGG ATAATCCGAA TTGCTGGTTA GTCGTCAATG GTCGCGATAA GCCGCAGCTG ATTTTTTATC GTCCTGTGGA TTTTTGGCAC AAAGTATCCG ATGTGCCGGA TATGTTCTGG ACCGAGCATT TTGATATTAA GTTGTTAACT AAGGCTGATA AGGTCGCTGA ACTGTTGCCC AAAGACACTG TTAATTGGGC TTATTTGGGC GAGCATTTAG ATGTAGCCGA AGTGCTGGGT TTTACCAGTC GCAATCCCGA TGCTGTGATG AGCTATTTGC ATTACCACAG AACCACTAAA ACTGAATATG AGCTGGAATG TATGCGCCGC GCCAACCAAA TTGCGGTGCA GGGACATTTG GCGGCTAAAA ATGCGTTTTA TAACGGTGCG AGTGAGTTCG AGATCCAACA GCACTATTTA TCTGCCGTGG GCCAGAGCGA AAATGAAGTG CCCTACGGCA ATATTATCGC TCTTAATCAA AATGCGGCGA TTTTGCATTA CACCGCGCTT GAGCATCAAA GCCCCGCGAA ACGTTTGTCT TTTCTTATCG ATGCGGGCGC GAGTTACTTT GGCTATGCGT CTGATATTAC TAGAACCTAT GCCTTCGAGA AGAATCGTTT CGATGAGTTG ATCGCTGCGA TGAATAAGGC GCAGCTTGAG CTCATCGACA TGATGCGTCC CGGTGTGCGT TATCCTGATT TACACTTAGC TACTCACGCT AAAGTCGCGC AAATGCTATT GGATTTTGAT TTAGCCACGG GGGATGCCCA AGGTTTGGTT GATCAAGGCA TTACCAGTGC TTTCTTCCCC CATGGTTTAG GCCACATGTT AGGTTTACAA GTGCATGATG TGGGTGGCTT CTCCCACGAT GAGCGCGGTA CCCATATCGC GGCGCCCGAG GCCCACCCAT TTTTGCGCTG CACCCGCATT TTAGCGCCAA ACCAAGTACT GACTATGGAA CCTGGGTTAT ACATCATAGA TACTTTGCTT AATGAGCTTA AACAAGATAG TCGTGGCCTG CAGATCAACT GGCAAACCGT TGATGAGTTA AGGCCTTTTG GCGGTATTCG TATCGAAGAT AACGTCATAG TGCATCAAGA TAGAAACGAG AATATGACCC GCGAGCTCGG CTTAGCCGAT TGA
|
Protein sequence | MDQLAHHYRA HIAELNRRVA EILSREALSG LVIHSGQPHR MFLDDINYPF KANPHFKAWL PVLDNPNCWL VVNGRDKPQL IFYRPVDFWH KVSDVPDMFW TEHFDIKLLT KADKVAELLP KDTVNWAYLG EHLDVAEVLG FTSRNPDAVM SYLHYHRTTK TEYELECMRR ANQIAVQGHL AAKNAFYNGA SEFEIQQHYL SAVGQSENEV PYGNIIALNQ NAAILHYTAL EHQSPAKRLS FLIDAGASYF GYASDITRTY AFEKNRFDEL IAAMNKAQLE LIDMMRPGVR YPDLHLATHA KVAQMLLDFD LATGDAQGLV DQGITSAFFP HGLGHMLGLQ VHDVGGFSHD ERGTHIAAPE AHPFLRCTRI LAPNQVLTME PGLYIIDTLL NELKQDSRGL QINWQTVDEL RPFGGIRIED NVIVHQDRNE NMTRELGLAD
|
| |