Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0465 |
Symbol | |
ID | 4078347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 483198 |
End bp | 485213 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005761 |
Product | peptidyl-dipeptidase Dcp |
Protein accession | YP_612460 |
Protein GI | 99080306 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATC CGCTGCTTCG CGACTGGCAA ACGCCCTTTG GGATTGCCCC GTTCGATCAG ATTTCCGACG AAGACTTTGC GCCTGCGCTT GATGAGGCGC TAAAGGCGCA TACTGCGCAG ATCGATGCGA TTGCCACCAA CCCCGCGCCG CCCACATTCG CCAATGTGAT CGAGGCTCTC GAGACGCCAT GCCAGCCACT TGAACAGGTG CTGTCGGTGT TTTTCTCGGT GGCCGGGGCG GACAGCAACC CGGCGCGCGA GGCGCTCCAG CGCGCGTTCT CTCCGAATCT TGCGGCGCAT TTCTCGGCCA TTACTGCCAA TAAGGCACTT TATGCACGGG TGCAGGCGGT GTGGGACCAA CGCGACACTC TGGACCTGAC CGACGAACAG GCCCGTGTCC TGATGCTCAC CCATCGTGGG TTTGTGCGCG GCGGCGCGGC GCTCAATGGC GCGGAAGATA CCCGCATGCA GGAGATCAAA TCCCGCCTTG CGACGCTTGG CACCAATTTC ACCCAGAACC TGCTTGCGGA TGAGCGGGAC TGGTTCATGG AGCTCTCCGA GGACGATCTT GAGGGGCTTC CTGATTTTGT TGTGAAGGCC GCACGGACCG CGGGGGCCGA AAAAGGCGTT TCCGGGCCGG TCATTACGCT TGCGCGCTCT ATCGTGACGC CTTTCCTGCA GTTCTCGCCG CGCAGGGACC TGCGTGAAAA GGCCTTTCGG GCCTGGGCTG CACGAGGGGC AAATGGCGGC GCACATGACA ACCGGGCGAT TGCGGCAGAG ATCCTGTCGC TGCGTGAAGA ACGGGCCAAG CTCTTGGGCT ATAGGAATTT TGCGGCCTAC AAGCTGGAAA CCGAGATGGC CAAAACGCCG GAGGCGGTGC GTGAGCTGTT GATGGATGTC TGGGGGCCAG CCAAGGCGCA GGCGGACAAG GACGCCGAGG TGTTGACGCG GATGATGCAT GAGGACGGCA TCAATGGCGA TCTGGCGCCT TGGGATTGGC ATTACTACGC AGAAAAACGT CGCAAGATAG AACATGATCT CGACGAAGCC GAACTCAAAC CATACCTGCA GCTCGACCGG ATGATTGAGG CGGCTTTTGC CTGCGCAAAT CGTCTGTTTG GCCTGGAATT CGCACCTCTT GATGTGCCGC TTTATCACAA GGACTGCCGC GCCTGGGAGG TTACGCGGGA AGGCAAGCAT CTGGCGGTGT TTATTGGGGA CTATTTCGCG CGGGGATCAA AGCGGTCCGG CGCGTGGTGT TCGGCTATGC GTGCGCAGGC GAAATTCCCC GAGGAACAGA CGCCAATCGT GGTCAATGTC TGCAACTTTG CCAAAGGCGA CCCGGCGTTG TTGTCCTATG ACGATGCGCG CACGCTGTTC CACGAGTTTG GCCATGCGCT GCACCAGATG CTGTCGAATG TAACCTATGA GAGTGTCTCC GGGACCTCCG TGGCGCGCGA TTTTGTGGAA CTCCCGAGCC AGCTCTATGA GCATTGGCTG GAGGTGCCCG AGGTACTGCA GGAGTTTGCC ACCCATGCAG AGACCGGCGC GCCGATGCCG AAGGAGATGC TCGACAAGGT GCTTGGGGCT GCGACCTTCA ACATGGGGTT CCAGACCGTC GAATATGTGG CGTCCGCATT GGTGGATCTG GAATTTCATG ATGGCAGCGC ACCCGAGGAT CCGATGGTCA AACAGGCCGA GGTTCTCGCA CAGATCGGCA TGCCGGCAGC GATTGTCATG CGCCATGCTA CGCCGCAGTT TGCGCATGTT TTCTCAGGCG ACGGGTACTC CTCGGGCTAC TACAGCTATA TGTGGTCAGA GGTAATGGAC GCCGATGCTT TTGAATCCTT TGAGGAAGCC GGCGGCGCCT TTGATCCTGC ACGCGCTGCG GCGCTTGAGG CACACATTCT GTCCAAAGGG GGCTCTGAGG ATGCCGCTGA GCTGTATACG GCGTTCCGGG GGCGACTTCC GGGGGTGGAC GCGCTTTTGA AAGGGCGTGG GCTGGCCGCC AAGTGA
|
Protein sequence | MTNPLLRDWQ TPFGIAPFDQ ISDEDFAPAL DEALKAHTAQ IDAIATNPAP PTFANVIEAL ETPCQPLEQV LSVFFSVAGA DSNPAREALQ RAFSPNLAAH FSAITANKAL YARVQAVWDQ RDTLDLTDEQ ARVLMLTHRG FVRGGAALNG AEDTRMQEIK SRLATLGTNF TQNLLADERD WFMELSEDDL EGLPDFVVKA ARTAGAEKGV SGPVITLARS IVTPFLQFSP RRDLREKAFR AWAARGANGG AHDNRAIAAE ILSLREERAK LLGYRNFAAY KLETEMAKTP EAVRELLMDV WGPAKAQADK DAEVLTRMMH EDGINGDLAP WDWHYYAEKR RKIEHDLDEA ELKPYLQLDR MIEAAFACAN RLFGLEFAPL DVPLYHKDCR AWEVTREGKH LAVFIGDYFA RGSKRSGAWC SAMRAQAKFP EEQTPIVVNV CNFAKGDPAL LSYDDARTLF HEFGHALHQM LSNVTYESVS GTSVARDFVE LPSQLYEHWL EVPEVLQEFA THAETGAPMP KEMLDKVLGA ATFNMGFQTV EYVASALVDL EFHDGSAPED PMVKQAEVLA QIGMPAAIVM RHATPQFAHV FSGDGYSSGY YSYMWSEVMD ADAFESFEEA GGAFDPARAA ALEAHILSKG GSEDAAELYT AFRGRLPGVD ALLKGRGLAA K
|
| |