Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0602 |
Symbol | |
ID | 4078640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 642346 |
End bp | 643503 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005899 |
Product | amidohydrolase 3 |
Protein accession | YP_612597 |
Protein GI | 99080443 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3454] Metal-dependent hydrolase involved in phosphonate metabolism |
TIGRFAM ID | [TIGR02318] phosphonate metabolism protein PhnM |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.188787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.301393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAG ATGGATACGT CACACTGCGT CTGGTGGGAG CAGATGTGCT GCGCGCAGAT GGGCTGGAGC GCAGCGGGGC GCTGACCCTC GCCGACGGAC TTCTGCAAGA CGCATCAGGC GCCAGAGAGG TGGATCTCAA CGGCTATCGC CTCCTTCCGG GGATTGTGGA CCTGCACGGA GACGGGTTTG AACGCCACGT CGCCCCCCGC CGAGGCGCCA TGAAGCAAAT GGGCGAAGGG ATCCGCGCCA CTGAGGCAGA GCTCGCTGCC AACGGCATTA CGACCGGAGT GCTGGCGCAA TTCCTGTCGT GGGAAGGCGG ACTGCGTGGG GCCGAGTTTG CACATCAGGT CTTTGAGGGC ATCCGCGAGG TGCGCAGCCA ATTGGTTACG GATCTGATCC CTCAGCTTCG ATTCGAGATC CACCTGCTTG ACCTCTATGA CACGCTTCCT GCGCAAATTG CTGATTGGGA GGTGCCCTAT GTGGTGTTCA ACGATCACCT CCCACACGAC AGGCTCGCGC AGGGGAAAAA GCCGCCGCGG TTGACGGGGC AGGCGCTCAA GGCTGGGCGA AATCCTGAAA AGCACTTTGA GATGCTGCTT GAGATGCACG CCCGCAGAGA CGAGGTTGGC CCCGCATTGG ACAGGCTGTG CAATCTTCTA CGCCAAAATG GCATCTGCTT TGGCAGTCAT GATGATCATA GCGCGCAGGA TCGGGAAATC TGGCGCGCGC GTGGCGCGCA TATCTCGGAG TTTCCGGAAA CCGCTGAGGC CGCAGAGGCC GCCCGGAGTG CGGGAGATCA CATCATTCTC GGTTCCCCGA ATGTGGTGCG CGGTGGCAGC CATAAGGGCA ATGCTTCGGC GGTGGAACTG ATCTCGATGG GACTCTGTGA TGCGCTGGCT TCGGATTATC ACTATCCCAG CCCGCGCCGT GCGGCGCTTA TGCTGGCCAA AACCGGCCTG CTGCCTTTGG AACAGGCCTG GGCGCTGGTC TCATCCGGCC CGGCAAAGAT CCTTGGTCTT CACGATCGTG GAGTCCTGAC GGATGGGAAA CGCGCAGATC TGGTGATCCT TGACGAAAAC AATCAGGTCG CGGCGACGCT TTCAGGCGGG CGCGTAAGCT ACATGAGCGG GGCTATTGCT GCGCGCTTCT TGAGCTGA
|
Protein sequence | MSEDGYVTLR LVGADVLRAD GLERSGALTL ADGLLQDASG AREVDLNGYR LLPGIVDLHG DGFERHVAPR RGAMKQMGEG IRATEAELAA NGITTGVLAQ FLSWEGGLRG AEFAHQVFEG IREVRSQLVT DLIPQLRFEI HLLDLYDTLP AQIADWEVPY VVFNDHLPHD RLAQGKKPPR LTGQALKAGR NPEKHFEMLL EMHARRDEVG PALDRLCNLL RQNGICFGSH DDHSAQDREI WRARGAHISE FPETAEAAEA ARSAGDHIIL GSPNVVRGGS HKGNASAVEL ISMGLCDALA SDYHYPSPRR AALMLAKTGL LPLEQAWALV SSGPAKILGL HDRGVLTDGK RADLVILDEN NQVAATLSGG RVSYMSGAIA ARFLS
|
| |