Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0854 |
Symbol | |
ID | 4076029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 906573 |
End bp | 907493 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006152 |
Product | haloacid dehalogenase-like hydrolase |
Protein accession | YP_612849 |
Protein GI | 99080695 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.770562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGCC AAAGTACAGA TATCGGCACT GACTGGGCGT TTCAGCGCTA TGAGAGCGTG CGTGCAACCC TGCCCGAGGC CAGTTTTGCG GCGGCCTCAC GGCGGGGAGG AGATCTTGGC GACACCGTCG GAGATTTTGA CGCCTATATC CTCGATGCCT TTGGGGTTTT GAACCGGGGC GAGACCGCAA TTGCGGGCGC GGTGGAGCGC ATGGCAGCAC TCAGGGCGCT TGGTAAAAGG CTGGTGGTGC TGACCAATGC GGCAAGCTAC ACGCGCGCAG AGGTGCTGGC GAAATATCAC CGGCTTGGCT TTGACTTCGA CGCGTCAGAA GTGGTCTCAA GCCGCGATGT GGCCTTTGCC GGTCTGCCCG CACTCCCGGC CGGCGCATTT TGGGCCGCCG CTGCCGCAGC AGGTGATGAT TTCAGTGATG CCCCCAGCGG CGCTGAAATC GCGCATCTGG CAGAGCGGCC GGAGCTCTTG CAAAGCGCGG GTGGCTTTCT GCTGCTGTCC TCTGCACGCT GGAGCGCGGC CGAAACAGAC GCGCTCACCG AGGCATTGTT GGCGTCTCCG CGTCCTCTGG TGGTCGCAAA CCCCGATCTC GTCGCCCCGC GCGAGGATGG CCTTTCGATG GAGCCGGGCC TGATCGCGCA GGAGCTGACC GAGCGCACCG GTCAGCCTGC AGCGTTTTTT GGCAAACCCT TTGGCAACGC CTTTGACGCG GCACTCGCGC GGCTCTCTGG CATTGAGCGC ACGCGTATTG CAATGGTCGG CGATACGCTG CACACGGATG TTCTGGGAGG CGCGGCTGCA GGGATCGGCT CCATCCTGAT CACCGATCAC GGCCTTTTTA AGGGCCATGA TGTCGCGCCA TACATCGAAA AGAGCGCAAT TCGACCGAGT TGGATCGTCT CGACAACATA A
|
Protein sequence | MAGQSTDIGT DWAFQRYESV RATLPEASFA AASRRGGDLG DTVGDFDAYI LDAFGVLNRG ETAIAGAVER MAALRALGKR LVVLTNAASY TRAEVLAKYH RLGFDFDASE VVSSRDVAFA GLPALPAGAF WAAAAAAGDD FSDAPSGAEI AHLAERPELL QSAGGFLLLS SARWSAAETD ALTEALLASP RPLVVANPDL VAPREDGLSM EPGLIAQELT ERTGQPAAFF GKPFGNAFDA ALARLSGIER TRIAMVGDTL HTDVLGGAAA GIGSILITDH GLFKGHDVAP YIEKSAIRPS WIVSTT
|
| |