Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3601 |
Symbol | |
ID | 4254165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 4306580 |
End bp | 4308565 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638120244 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_735722 |
Protein GI | 113971929 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACTT TGCCATTTGC CGCGCTGGCT GTAATTTGCT TGCCTATTAT GCCTTTCAGC ACATTAGCGG CCGAAACCTC GGCGACAAAT GCCCTGTCAC AATCCCAATT ATTCAGTCGC GGTAACGAGT ACTCTAACGT CAAGATCTCG CCTACCGGCA AGTACTTAAG TGCAATCACT AGTGTCGAAG GCAAGAATGT CCTCTTAGTA CTCGATGCCC AAACCAAAAA ACTGCTTAAC GCAATTCGCT TCCCGAGCAA CGCGCAGGTA GGCACCTATG AATGGGCAAA TAGTGAGCGT ATTGTACTTG CAAAAGAATA CCTTAAGGGC TGGAGCGATG TGCCCCAATA CTACGGCGAA TTAATGGCGG TCAATGCCGA TGGTTCTCGC CCTAAATATC TATTTGGATA TAACAGCGGC GAGCAGCAAA CCGGCTCGAA TATCAAGAAA AATACTCCTA TAAGTGCTAC CGCCTTTATT CTCGATCCTC TGCCTGATGA CGAGCGTTAT ATGCTGGTCA ATGCGATCCC ATGGGGTGGT GCCCCAGACT TGAGTGAAAC ACTTCAGGAT GTTTACCGCG TAGACCTTTT TAGTGGGGTT CGTAAACGCA TCACAGGCTC CCCCATTGGC CGAGCACGCT TTATGACAGA TCATGAAGGT GAAGTCCGCT TTGTGGCTGG GGAAGATGGC AAAAACATCA CTAAAGTCTT TTACCGCAAA GATGGCGAAT GGTTAAACAC CGATAAACTC AACTTAGGCT TAAGTGATTT TACACCTATC TCTTTCGCCG ATAATAAAAA TAGTATTTAC GCCGCAGGCC GAGTGGGCAC TGAAACCTTA GGTGTCTATC GCATCAATCT CGAAACAGGG GAGAAGGCCG AGATTATTCA AGATGAAGTG GTCGATCCAA GCAACTTCTG GATAAATGGC ACTAACAAAC AGCTCTATGC CGTTGAGTTT GAAAATGGCT ATCCAAGCTA TGCCTTTGTC GATAACAACG ATAACCATGC CAAACTGCTT AAGGATTTAC TCGCGGCCCT GCCGGGGCAT CAAGTACAAA TCGTCAGCGA AACCCGTAAT GGCGAACAAT TGGTGGTGAT TGCATTTAAC GATCGCAATC CCGGTGATTA CTATTTGTTT GATACGAAAA AGCTCAAGCT AGAGTATCTG GCCGCCGCCC GTAAGTGGCT CGACCCAGAA AAAATGGCTG AGGTTAAACC TATTAGTTTC ACTAACCGTG ATGGCCAGAA AATCCATGGC TACTTAACCT TACCCAATGG AAAAGAAGCC AAAAATTTAC CTTTGGTCGT CAATCCCCAT GGTGGCCCCC ATGGCATTCG TGACTGGTGG GGGTTTGACC CACAAAACCA ACTACTTGCT CAAAATGGTA TGGCGGTTTT ACAGGTTAAC TTCCGTGGCT CAGGTGGTTA TGGCGAACGT TTCGAGCAAG CTGGCTATCA AAAATGGGGC TCGGATATTC AGCACGATAT TATCGATGCC ACTCAATATG TGATTGACCA AGGCCTTGCC GATAAGGAAC GGGTGTGTAT CGCAGGCGGT AGCTTTGGCG GCTATAGCGC CTTGCAAAGT GCGGTATTAG CACCCGATAT GTTTAAATGC GCGGTTGGTT TTGCCGGTGT GTATGATCTG GAATTGATGT TTGATGAAGG TGACGTCGCC AGAACACGTT CAGGCACAAG CTATCTTAAG GACGTACTTG GCCAAGACAA AGCCACCCTA AAAGCCATGT CTCCCTCTGA GAACGTAGCA AAATTAAAAG CGAACCTCTT ACTGGTGCAC GGTGGTGACG ATGAGCGAGC ACCGATTGAG CAACTCGAAT CACTCGAAAA AGCCCTCAAG GCCCATAATT ATCCCTATCA AAAACTGGTG ATGGATAACG AAGGCCATGG TTTTTATGAT GATAGCCATA GAGCCAAGTA TTACGATCAG ATGCTAAGCT TCTTAAAAAC CAACCTGAAA CTTTAG
|
Protein sequence | MKTLPFAALA VICLPIMPFS TLAAETSATN ALSQSQLFSR GNEYSNVKIS PTGKYLSAIT SVEGKNVLLV LDAQTKKLLN AIRFPSNAQV GTYEWANSER IVLAKEYLKG WSDVPQYYGE LMAVNADGSR PKYLFGYNSG EQQTGSNIKK NTPISATAFI LDPLPDDERY MLVNAIPWGG APDLSETLQD VYRVDLFSGV RKRITGSPIG RARFMTDHEG EVRFVAGEDG KNITKVFYRK DGEWLNTDKL NLGLSDFTPI SFADNKNSIY AAGRVGTETL GVYRINLETG EKAEIIQDEV VDPSNFWING TNKQLYAVEF ENGYPSYAFV DNNDNHAKLL KDLLAALPGH QVQIVSETRN GEQLVVIAFN DRNPGDYYLF DTKKLKLEYL AAARKWLDPE KMAEVKPISF TNRDGQKIHG YLTLPNGKEA KNLPLVVNPH GGPHGIRDWW GFDPQNQLLA QNGMAVLQVN FRGSGGYGER FEQAGYQKWG SDIQHDIIDA TQYVIDQGLA DKERVCIAGG SFGGYSALQS AVLAPDMFKC AVGFAGVYDL ELMFDEGDVA RTRSGTSYLK DVLGQDKATL KAMSPSENVA KLKANLLLVH GGDDERAPIE QLESLEKALK AHNYPYQKLV MDNEGHGFYD DSHRAKYYDQ MLSFLKTNLK L
|
| |