Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0019 |
Symbol | |
ID | 4250705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 23767 |
End bp | 25086 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638116558 |
Product | proline dipeptidase |
Protein accession | YP_732158 |
Protein GI | 113968365 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0806227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000125098 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCACT TGGCTCATCA CTATCATGCC CACATCGCCG AGCTTAATCG TCGAGTCGCT GAAATTGTAT CGCGTGAAGC ATTATCAGGT TTAGTGATCC ACTCGGGTCA GCCACATCGA ATGTTTTTGG ACGACATTAA TTATCCTTTT AAAGCCAATC CCCACTTCAA GGCTTGGTTA CCTGTGTTGG ATAACCCTAA CTGCTGGTTA GTAGTAAACG GTCGCGATAA GCCACAACTG ATTTTTTATC ACCCTGTGGA CTTTTGGCAT AAGGTGTCTG ACGTGCCGGA GATGTTCTGG ACCGAGCATT TTGAGATCAA GTTACTCACT AAGGCCGATA AGGTTGCCGA GCTATTGCCT AGCGATATCA CCAACTGGGC TTATTTAGGT GAGCATTTAG ATGTGGCTGA GGTACTGGGT TTCACCAGTC GCAATCCTGA CTCTGTGATG AGCTATCTGC ATTTCCACCG TACCACTAAA ACCGAATATG AACTCGAATG TATGCGCCGT GCGAACCAGA TCGCGGTGCA AGGGCATTTA GCGGCTAAAA ATGCCTTCTA TAACGGTGCC AGCGAGTTTG AAATTCAGCA GCAGTATTTA TCTGCCGTGG GACAGGGCGA GAACGAAGTG CCCTACGGTA ATATCATTGC CTTAAACCAA AATGCGGCGA TTTTGCATTA CACCGCGCTC GAGCATCAAA ATCCGGCGCG GCGCTTATCT TTTTTAATCG ATGCGGGCGC CAGTTATTTT GGCTACGCGT CGGATATCAC GCGTACCTAT GCGTTTGAGA AGAATCGTTT CGATGAATTG ATCACTGCCA TGAACAAGGC GCAGCTCGAA CTGATCGATA TGATGCGCCC AGGTGTGCGT TATCCCGATT TACACTTGGC TACCCATGGC AAAGTCGCGC AAATGCTGTT GGATTTTGAG TTAGCCACGG GCGATGCCCA AGGCTTAGTC GACCAAGGCA TAACCAGCGC CTTCTTCCCC CACGGACTCG GCCATATGTT AGGCTTGCAA GTACATGATG TGGGAGGTTT TGCCTTCGAT GAGCGTGGTA CCCATATTCC GGCCCCTGAG GCCCATCCGT TCCTGCGTTG CACCCGCATT TTAGCGCCAA ACCAAGTGCT AACGATGGAG CCAGGATTAT ACATTATCGA TACTTTACTC AATGAGCTAA AACAAGATAG CCGTGACCAG CAGATCAATT GGCGCACCGT TGATGAGTTG CGACCTTTCG GTGGTATCCG TATCGAGGAC AATGTGATTG TGCATCAGGA TCGAAACGAA AATATGACCC GCGAGCTGGG CTTAGCGTGA
|
Protein sequence | MDHLAHHYHA HIAELNRRVA EIVSREALSG LVIHSGQPHR MFLDDINYPF KANPHFKAWL PVLDNPNCWL VVNGRDKPQL IFYHPVDFWH KVSDVPEMFW TEHFEIKLLT KADKVAELLP SDITNWAYLG EHLDVAEVLG FTSRNPDSVM SYLHFHRTTK TEYELECMRR ANQIAVQGHL AAKNAFYNGA SEFEIQQQYL SAVGQGENEV PYGNIIALNQ NAAILHYTAL EHQNPARRLS FLIDAGASYF GYASDITRTY AFEKNRFDEL ITAMNKAQLE LIDMMRPGVR YPDLHLATHG KVAQMLLDFE LATGDAQGLV DQGITSAFFP HGLGHMLGLQ VHDVGGFAFD ERGTHIPAPE AHPFLRCTRI LAPNQVLTME PGLYIIDTLL NELKQDSRDQ QINWRTVDEL RPFGGIRIED NVIVHQDRNE NMTRELGLA
|
| |