Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_0649 |
Symbol | |
ID | 4476859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | + |
Start bp | 766577 |
End bp | 768868 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639725184 |
Product | dipeptidyl-peptidase IV |
Protein accession | YP_868293 |
Protein GI | 117919101 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | [TIGR01168] Gram-positive signal peptide, YSIRK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000821828 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00018889 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTAAAA ATGGGTTAGC TTCTTCCCTG CGCTCGGTGA CATTGGGCGC CTCATCTCTA CTGATAGCGA GTCAACTGGC CATGGTTTCT AGCGCGATAT TTACCCCCGC CTATGCGCTT GAGGGCGGCA CAACGCCGCT GACTATCGAG CGCATGAATG CGTCGCCTGC CTTGGCGGGC ACGAGTCCCC GAGGATTAAA ATTATCCCCC GATGGTCAAA GGGTGACCTA TCTTGCGGGA CGTAAAGACA ACCAAAATTT TTACGATCTC TGGCAGATGG ACGTGAAGAC GGGCGAATCT AGCCTGTTGC TAAACGCCGA TAAGCTGGCG AGCAATGAGT TATCCGATGA AGAAAAGGCC CGCCGCGAGC GCCAACGTAT TTATGGCGAA GGCATTATGG AATATTTCTG GGCCGACGAT AGCAAGGCGC TGTTGATCCC TGCCTCGGGT AATCTGTATT ACTTCTCCTT AGCGGATAAC AGCGTTAGCC AACTGCCGAT AGGCGAGGGC TTTGCCACCG ATGCTCGCCT ATCGCCCAAG GGGAATTTTG TGTCCTTCGT GCGGGACCAA AATCTGTATG TGCTCAATTT AGCGACTAAA AAGCTTGAGG CCATGACTAC AGACGGTGGC GGCGCGATTA AAAATGCCAT GGCGGAATTT GTTGCCCAGG AAGAAATGGA TCGTATGACG GGTTACTGGT GGGCGCCCGA TGAATCGGCC ATCGCCTTTA CCCGTATCGA TGAGTCGGGC GTTGAGTTGG CCACCCGCAA TGAAATCTAC GCCGATGGCA TTAAGCTTAC CGAGCAGCGT TATCCGGCTG CGGGTAAAAA CAATGTCGAG ATCCAACTGG GTGTTGTGAC CCTTAAGAAT AAAGCCATCA ATTGGGTCAC CTTAAGCGAT GATAAAAATA AAGATATTTA CTTGCCTCGC GTCGATTGGT TGCCGGATAG CAAGCACTTA TCCTTCCAGT GGCAGAGCCG CGATCAACAA AAATTAGACC TGCAACTGGT GGCTCTCGAT GCGCTGACCA AGCCGAAAAC CTTAGTCAAA GAGCGTAGCG ATGCTTGGGT GAATCTTAAC AATGACCTGC ACTTCTTAAA GCAACAATCG GCGTTTATCT GGGCCTCGGA GCGTGACGGC TTTAATCATC TGTATTTGTT TGACCTAAAA GGTAAGCTGA AGACGCAATT GACCAAGGGC AATTGGGCGG TCGATGAGCT GGAATATATT GATGAAACCG CAGGTTGGGT GTATTTCACT GGCCGCAAAG ATACCCCTAT CGAAAAACAC CTCTACCGTG TGCCGTTAGC GGGTGGCAAG GTTGAGCGCG TGAGCAGCGA GGCGGGGATG CACAATCCGG TATTTGCCGA TAATCAAAGC GTGTATCTAG ATTATTTCAA TAGCTTATCT CAGCCACCAC AGATCAGTTT GCACGGCGAT AAGGGCCAAC ATCTGGCGTG GGTAGAGCAG AACCAAGTCA AAGCGGGCCA TCCTCTGTAT GACTATGCTG GCCTTTGGCA ATTACCAGAG TTTAAAGAGC TTAAGGCGGA AGACGGACAA GTCCTGCAAA CGCGCCTCTT TAAACCCATG CCCTTCGATG CGAGCAAAAA GTATCCGGTA GTCGTACGTG TTTATGGCGG CCCGCATGCG CAATTGGTTA CCAATAGCTG GAGTGAGCAG GATTACTTTA CCCAATACTT AGTGCAACAA GGTTATGTGG TATTTCAACT GGATAACCGT GGCAGCGCCC ACAGAGGCAC CCAGTTCGAG CAAGTGATTT ACCGTCACTT GGGTGAAGCG GAGGTGAACG ACCAAAAGGT TGGCGTCGAT TATCTGCGCA GCTTGCCCTT TGTCGATAGC AACAACGTGG CGATTTATGG CCACAGCTAC GGCGGTTATA TGGCCTTGAT GAGCCTGTTT AAGGCGCCGG ATTACTTTAA GGCGGCGATT TCGGGCGCGC CCGTCACAGA TTGGCGTTTG TACGACACTC ACTACACCGA GCGTTATCTC GCCCATCCGG CGAATAATGA ACAAGGTTAC GAAGCCAGTA GCGTGTTCCC CTATGTGAAA AACTACCAGT CGGGGCTGTT GATGTACCAC GGCATGGCCG ACGATAACGT GTTGTTTGAA AACAGCACTC GCGTGTACAA GGCGTTGCAG GACGAAGGCA AACTGTTCCA GATGATTGAT TATCCTGGCT CTAAGCATTC GATGCGCGGT GAAAAAGTGC GTAATCACTT ATATCGCTCG CTGGCGGATT TCCTCGATAG ACAGCTTAAA AGCGGCGAAT AA
|
Protein sequence | MTKNGLASSL RSVTLGASSL LIASQLAMVS SAIFTPAYAL EGGTTPLTIE RMNASPALAG TSPRGLKLSP DGQRVTYLAG RKDNQNFYDL WQMDVKTGES SLLLNADKLA SNELSDEEKA RRERQRIYGE GIMEYFWADD SKALLIPASG NLYYFSLADN SVSQLPIGEG FATDARLSPK GNFVSFVRDQ NLYVLNLATK KLEAMTTDGG GAIKNAMAEF VAQEEMDRMT GYWWAPDESA IAFTRIDESG VELATRNEIY ADGIKLTEQR YPAAGKNNVE IQLGVVTLKN KAINWVTLSD DKNKDIYLPR VDWLPDSKHL SFQWQSRDQQ KLDLQLVALD ALTKPKTLVK ERSDAWVNLN NDLHFLKQQS AFIWASERDG FNHLYLFDLK GKLKTQLTKG NWAVDELEYI DETAGWVYFT GRKDTPIEKH LYRVPLAGGK VERVSSEAGM HNPVFADNQS VYLDYFNSLS QPPQISLHGD KGQHLAWVEQ NQVKAGHPLY DYAGLWQLPE FKELKAEDGQ VLQTRLFKPM PFDASKKYPV VVRVYGGPHA QLVTNSWSEQ DYFTQYLVQQ GYVVFQLDNR GSAHRGTQFE QVIYRHLGEA EVNDQKVGVD YLRSLPFVDS NNVAIYGHSY GGYMALMSLF KAPDYFKAAI SGAPVTDWRL YDTHYTERYL AHPANNEQGY EASSVFPYVK NYQSGLLMYH GMADDNVLFE NSTRVYKALQ DEGKLFQMID YPGSKHSMRG EKVRNHLYRS LADFLDRQLK SGE
|
| |