Gene TM1040_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0465 
Symbol 
ID4078347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp483198 
End bp485213 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content60% 
IMG OID638005761 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_612460 
Protein GI99080306 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC CGCTGCTTCG CGACTGGCAA ACGCCCTTTG GGATTGCCCC GTTCGATCAG 
ATTTCCGACG AAGACTTTGC GCCTGCGCTT GATGAGGCGC TAAAGGCGCA TACTGCGCAG
ATCGATGCGA TTGCCACCAA CCCCGCGCCG CCCACATTCG CCAATGTGAT CGAGGCTCTC
GAGACGCCAT GCCAGCCACT TGAACAGGTG CTGTCGGTGT TTTTCTCGGT GGCCGGGGCG
GACAGCAACC CGGCGCGCGA GGCGCTCCAG CGCGCGTTCT CTCCGAATCT TGCGGCGCAT
TTCTCGGCCA TTACTGCCAA TAAGGCACTT TATGCACGGG TGCAGGCGGT GTGGGACCAA
CGCGACACTC TGGACCTGAC CGACGAACAG GCCCGTGTCC TGATGCTCAC CCATCGTGGG
TTTGTGCGCG GCGGCGCGGC GCTCAATGGC GCGGAAGATA CCCGCATGCA GGAGATCAAA
TCCCGCCTTG CGACGCTTGG CACCAATTTC ACCCAGAACC TGCTTGCGGA TGAGCGGGAC
TGGTTCATGG AGCTCTCCGA GGACGATCTT GAGGGGCTTC CTGATTTTGT TGTGAAGGCC
GCACGGACCG CGGGGGCCGA AAAAGGCGTT TCCGGGCCGG TCATTACGCT TGCGCGCTCT
ATCGTGACGC CTTTCCTGCA GTTCTCGCCG CGCAGGGACC TGCGTGAAAA GGCCTTTCGG
GCCTGGGCTG CACGAGGGGC AAATGGCGGC GCACATGACA ACCGGGCGAT TGCGGCAGAG
ATCCTGTCGC TGCGTGAAGA ACGGGCCAAG CTCTTGGGCT ATAGGAATTT TGCGGCCTAC
AAGCTGGAAA CCGAGATGGC CAAAACGCCG GAGGCGGTGC GTGAGCTGTT GATGGATGTC
TGGGGGCCAG CCAAGGCGCA GGCGGACAAG GACGCCGAGG TGTTGACGCG GATGATGCAT
GAGGACGGCA TCAATGGCGA TCTGGCGCCT TGGGATTGGC ATTACTACGC AGAAAAACGT
CGCAAGATAG AACATGATCT CGACGAAGCC GAACTCAAAC CATACCTGCA GCTCGACCGG
ATGATTGAGG CGGCTTTTGC CTGCGCAAAT CGTCTGTTTG GCCTGGAATT CGCACCTCTT
GATGTGCCGC TTTATCACAA GGACTGCCGC GCCTGGGAGG TTACGCGGGA AGGCAAGCAT
CTGGCGGTGT TTATTGGGGA CTATTTCGCG CGGGGATCAA AGCGGTCCGG CGCGTGGTGT
TCGGCTATGC GTGCGCAGGC GAAATTCCCC GAGGAACAGA CGCCAATCGT GGTCAATGTC
TGCAACTTTG CCAAAGGCGA CCCGGCGTTG TTGTCCTATG ACGATGCGCG CACGCTGTTC
CACGAGTTTG GCCATGCGCT GCACCAGATG CTGTCGAATG TAACCTATGA GAGTGTCTCC
GGGACCTCCG TGGCGCGCGA TTTTGTGGAA CTCCCGAGCC AGCTCTATGA GCATTGGCTG
GAGGTGCCCG AGGTACTGCA GGAGTTTGCC ACCCATGCAG AGACCGGCGC GCCGATGCCG
AAGGAGATGC TCGACAAGGT GCTTGGGGCT GCGACCTTCA ACATGGGGTT CCAGACCGTC
GAATATGTGG CGTCCGCATT GGTGGATCTG GAATTTCATG ATGGCAGCGC ACCCGAGGAT
CCGATGGTCA AACAGGCCGA GGTTCTCGCA CAGATCGGCA TGCCGGCAGC GATTGTCATG
CGCCATGCTA CGCCGCAGTT TGCGCATGTT TTCTCAGGCG ACGGGTACTC CTCGGGCTAC
TACAGCTATA TGTGGTCAGA GGTAATGGAC GCCGATGCTT TTGAATCCTT TGAGGAAGCC
GGCGGCGCCT TTGATCCTGC ACGCGCTGCG GCGCTTGAGG CACACATTCT GTCCAAAGGG
GGCTCTGAGG ATGCCGCTGA GCTGTATACG GCGTTCCGGG GGCGACTTCC GGGGGTGGAC
GCGCTTTTGA AAGGGCGTGG GCTGGCCGCC AAGTGA
 
Protein sequence
MTNPLLRDWQ TPFGIAPFDQ ISDEDFAPAL DEALKAHTAQ IDAIATNPAP PTFANVIEAL 
ETPCQPLEQV LSVFFSVAGA DSNPAREALQ RAFSPNLAAH FSAITANKAL YARVQAVWDQ
RDTLDLTDEQ ARVLMLTHRG FVRGGAALNG AEDTRMQEIK SRLATLGTNF TQNLLADERD
WFMELSEDDL EGLPDFVVKA ARTAGAEKGV SGPVITLARS IVTPFLQFSP RRDLREKAFR
AWAARGANGG AHDNRAIAAE ILSLREERAK LLGYRNFAAY KLETEMAKTP EAVRELLMDV
WGPAKAQADK DAEVLTRMMH EDGINGDLAP WDWHYYAEKR RKIEHDLDEA ELKPYLQLDR
MIEAAFACAN RLFGLEFAPL DVPLYHKDCR AWEVTREGKH LAVFIGDYFA RGSKRSGAWC
SAMRAQAKFP EEQTPIVVNV CNFAKGDPAL LSYDDARTLF HEFGHALHQM LSNVTYESVS
GTSVARDFVE LPSQLYEHWL EVPEVLQEFA THAETGAPMP KEMLDKVLGA ATFNMGFQTV
EYVASALVDL EFHDGSAPED PMVKQAEVLA QIGMPAAIVM RHATPQFAHV FSGDGYSSGY
YSYMWSEVMD ADAFESFEEA GGAFDPARAA ALEAHILSKG GSEDAAELYT AFRGRLPGVD
ALLKGRGLAA K