Gene TM1040_0719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0719 
Symbol 
ID4076996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp776435 
End bp777598 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content60% 
IMG OID638006016 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_612714 
Protein GI99080560 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.093275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCA AGAACCGCTT TGCTGAGTTG CTGCCCGAAA TTACTGCGTG GCGACGTGAC 
CTGCACGAAA ACCCCGAAAT CCTGTTCGAA ACCCATCGCA CCAGCGCGCT GGTGGAGGAG
AAACTCAAGG CGTTCGGCTG CGACGAAGTG GTCACGGGCA TTGGCCGCAC CGGCGTTGTG
GGCGTCATCA AGGGTAAATC CTCCGCATCA GGCAAGGTCA TCGGTCTGCG GGCCGATATG
GACGCGCTGC CGATCCATGA AGAAACCGGG CTGGAGTATG CCTCCAAGAC CGCAAACGCC
ATGCACGCCT GTGGTCATGA CGGTCATACC GCCATGCTTT TGGGCGCGGC GAAATATCTC
TCCGAGACGC GGAACTTCGA CGGCACCGTT GTGGTGATCT TTCAGCCTGC CGAAGAAGGC
GGCGGCGGCG GCAAGGAAAT GTGCGATGAT GGCATGATGG AGCGCTGGGG CATCCAGGAA
GTCTATGGCA TGCACAATTG GCCGGGTCGC CCGGTTGGAA GCTTTGCAAT CCGTTCGGGT
GCCTTCTTTG CGGCGACCGA TCAGTTCGAC ATCACCTTTA CCGGCAAAGG CGGCCATGCC
GCTAAGCCGC AGGAAACCAT CGATTCGACC GTGATGGCAT CGCAGGCGGT GCTTGCCCTG
CAAACCATCG CTGCCCGCAA CGCCGATCCC GTGCATCAGA TCGTGGTCTC TGTGACCTCT
TTTGAGACCT CCTCCAAGGC GTTCAACGTG ATTCCTGAGC GCGTTCAGAT CAAAGGCACC
GTGCGCACCA TGGCGCCCGA GATGCGGGAT CTTGCTGAAA AACGTATCAA GGAAATCTGC
GCGGGCATCG CAGCGACCTT TGGCGGTGAA GCCGATGTGA CTTACCACCG TGGCTATCCG
GTGATGGTGA ACCATGACGA GCAGACCGAG TTTGCCGCCA AAGTGGCGCG TGACATTTCC
GGGCAGTGCG ATGAGGCGCC GCTGGTGATG GGGGGCGAAG ACTTTGCCTT CATGCTCGAA
GAGCGTCCCG GTGCCTATAT TCTCGTCGGC AATGGGGACA CCGCCGCCGT GCATCACCCC
AAGTATAACT TCACCGATGA TGCGATTCCC GCAGGCTGCA GCTGGTGGGC GGAGATCGTC
GAGCAGCGCA TGCCCGCAGC CTGA
 
Protein sequence
MPVKNRFAEL LPEITAWRRD LHENPEILFE THRTSALVEE KLKAFGCDEV VTGIGRTGVV 
GVIKGKSSAS GKVIGLRADM DALPIHEETG LEYASKTANA MHACGHDGHT AMLLGAAKYL
SETRNFDGTV VVIFQPAEEG GGGGKEMCDD GMMERWGIQE VYGMHNWPGR PVGSFAIRSG
AFFAATDQFD ITFTGKGGHA AKPQETIDST VMASQAVLAL QTIAARNADP VHQIVVSVTS
FETSSKAFNV IPERVQIKGT VRTMAPEMRD LAEKRIKEIC AGIAATFGGE ADVTYHRGYP
VMVNHDEQTE FAAKVARDIS GQCDEAPLVM GGEDFAFMLE ERPGAYILVG NGDTAAVHHP
KYNFTDDAIP AGCSWWAEIV EQRMPAA