Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_3990 |
Symbol | |
ID | 1171621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | - |
Start bp | 4127030 |
End bp | 4129321 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637345741 |
Product | dipeptidyl peptidase IV |
Protein accession | NP_719520 |
Protein GI | 24375477 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAAAA ATGGGTTAGC TTCCTCCCTG CGCGCCGTGA AGTTGGGCGC TTCATCCCTA TTGATTGCGA GTCAACTGGC CATGATTTCT ACCCTTACCA CAGCTTCTGC CTATGCGCTT GAGGGCGGTA TGACGCCACT CACTATCGAG CGGATGAATG CTTCTCCCGC CTTGGCGGGC ACCAGTCCGC GCGGATTAAA GCTATCCCCC GATGGCCAAA AGGTGACCTA TCTAGCGGGG CGTAAAGATA ACCAAAACTT TTACGATCTT TGGCAAATGG ATGTTAAAAC AGGAAAATCT AGCCTGTTGC TGAATGCCGA TAAACTGGCG AGTAATGAAC TGTCCGATGA AGAAAAGGCC CGCCGCGAAC GCCAACGGAT TTATGGCGAA GGCATTATGG AATATTTCTG GGCCGACGAT AGCAAAGCGC TGTTGATCCC CGCTGCGGGC AATTTGTATT ACTTCTCCTT GGCGGATAAC AGCGTCAGCC TATTGCCGAT TGGTGAAGGT TTTGCCACTG ATGCCCGCCT ATCGCCCAAG GGTAACTTTG TATCCTTCGT TCGTGATCAA AATCTGTATG TACTTAATCT TGCAACTAAA AAGCTTGAGG CCATGACCAC CGATGGTGGC GGCGTGATTA AAAATGCCAT GGCAGAATTT GTGGCGCAGG AAGAAATGGA TCGCATGACA GGTTACTGGT GGGCGCCCGA TGAGTCGGCC ATTGCCTTTA CTCGTATTGA TGAATCCGCT GTTGAGTTAG TCACCCGCAA TGAAATCTAT GCCGATGGCA TTAAACTCAC CGAGCAGCGT TATCCGGCGG CAGGTAAAAA CAATGTCGAG ATCCAACTCG GTGTAGTAAC GCTGAAAAAT AAAGCCATCG ATTGGGTAAC CTTAAGCGAT GATAAAAATA AGGATATTTA CCTGCCTCGC GTCGATTGGT TGCCGGATAG CAAGCATTTA TCCTTTCAAT GGCAGAGCCG CGATCAACAA AAGTTAGACC TGCAACTGGT TGCCTTAGAT TCGCTGACCA AGCCGAAAAC CTTAGTCAAA GAGCGCAGCG ATGCTTGGGT GAATCTTAAC AACGACTTGC ACTTCTTAAA GCAACAATCG GCTTTTATTT GGGCCTCGGA GCGTGATGGC TTTAATCATC TCTATTTGTT TGACCTAAAA GGTAAGCTGA AGACGCAACT GACCAAGGGC AATTGGGCTG TCGATGAGCT GGAATTTATC GATGAAACCG CAGGTTGGGT GTATTTCACT GGCCGCAAAG ATACCCCAAT CGAAAAACAC CTTTACCGTG TGCCGTTAGC GGGCGGAAAT ATTGAGCGCG TGAGCAGTGA GGCGGGGATG CACGATCCGG TATTTGCCGA TAATCAAAGC GTGTATCTCG ATTATTTCAA TAGCTTATCT CAACCGCCGC AGGTCAGTTT GCATGGCGAT AAGGGCCAGC ATTTGGCCTG GGTTGAGCAG AACCAAGTCA AAGCGGGTCA TCCTCTGTAT GACTATGCTG GACTCTGGCA ACTACCTGAG TTTAAAGAGC TTAAAGCCGA AGATGGCCAA ATCTTACAAA CGCGTTTATT TAAACCCGTG CCCTTCGATG CGGGGAAAAA GTACCCCGCG GTCGTGCGGG TTTATGGCGG CCCACATGCG CAGTTGGTGA CCAATAGCTG GAGTGAGCAA GATTACTTTA CCCAATACTT AGTTCAGCAA GGTTATGTGG TATTTCAATT GGATAACCGC GGCAGTGCCC ACAGGGGTAC CAAGTTTGAG CAAGTGATTT ACCGTCATTT GGGCGAAGCG GAAGTCAATG ATCAAAAGGT TGGGGTGGAT TATCTGCGCA GCTTGCCCTT TGTCGATAGC AATAATGTGG CGATTTATGG CCACAGCTAC GGCGGTTATA TGGCTTTGAT GAGTCTATTT AAGGCGCCGG ATTACTTTAA GGCGGCGATT TCGGGTGCAC CTGTCACCGA TTGGCGTTTG TATGACACCC ATTATACCGA GCGTTATTTA GCCCATCCCG CAAGCAATGA GCAAGGTTAT GAAGCCAGTA GCCTGTTCCC CTATGTGAAA AACTACCAGT CGGGGCTGTT GATGTACCAC GGCATGGCCG ACGATAACGT GTTGTTTGAA AACAGTACTC GCGTCTATAA GGCGTTGCAG GACGAAGGCA AGTTGTTTCA AATGATTGAT TATCCAGGTT CTAAGCATTC AATGCGTGGC GAAAAGGTGC GTAACCATTT ATATCGCTCG CTGGCGGATT TCCTCGATAG ACAGCTTAAA AACGGCAAAT AA
|
Protein sequence | MTKNGLASSL RAVKLGASSL LIASQLAMIS TLTTASAYAL EGGMTPLTIE RMNASPALAG TSPRGLKLSP DGQKVTYLAG RKDNQNFYDL WQMDVKTGKS SLLLNADKLA SNELSDEEKA RRERQRIYGE GIMEYFWADD SKALLIPAAG NLYYFSLADN SVSLLPIGEG FATDARLSPK GNFVSFVRDQ NLYVLNLATK KLEAMTTDGG GVIKNAMAEF VAQEEMDRMT GYWWAPDESA IAFTRIDESA VELVTRNEIY ADGIKLTEQR YPAAGKNNVE IQLGVVTLKN KAIDWVTLSD DKNKDIYLPR VDWLPDSKHL SFQWQSRDQQ KLDLQLVALD SLTKPKTLVK ERSDAWVNLN NDLHFLKQQS AFIWASERDG FNHLYLFDLK GKLKTQLTKG NWAVDELEFI DETAGWVYFT GRKDTPIEKH LYRVPLAGGN IERVSSEAGM HDPVFADNQS VYLDYFNSLS QPPQVSLHGD KGQHLAWVEQ NQVKAGHPLY DYAGLWQLPE FKELKAEDGQ ILQTRLFKPV PFDAGKKYPA VVRVYGGPHA QLVTNSWSEQ DYFTQYLVQQ GYVVFQLDNR GSAHRGTKFE QVIYRHLGEA EVNDQKVGVD YLRSLPFVDS NNVAIYGHSY GGYMALMSLF KAPDYFKAAI SGAPVTDWRL YDTHYTERYL AHPASNEQGY EASSLFPYVK NYQSGLLMYH GMADDNVLFE NSTRVYKALQ DEGKLFQMID YPGSKHSMRG EKVRNHLYRS LADFLDRQLK NGK
|
| |