Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1237 |
Symbol | |
ID | 4076352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1333106 |
End bp | 1334272 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638006545 |
Product | HD domain-containing protein |
Protein accession | YP_613232 |
Protein GI | 99081078 |
COG category | [R] General function prediction only |
COG ID | [COG1896] Predicted hydrolases of HD superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAG ATCTTGAGCA GCAATTCGAA TTCCTCACTG AAATCGAAAG GCTGCGCGAG GTGGAACGGC AGAACCTTCT GCTGGACGGC AGCCGGGTAG AAAATTCGGC CGAGCACAGC TGGCATCTGG CGCTCTATGC GCTGGTGTTT GCCCCCTATG CGCCCTCCGA CGTGTCCATA ACCCGTGTCA TCGAGATGCT CTTGCTGCAT GACATCGTGG AGATCGACGT TGGCGATCAT CCGATTGATG AGCCAACAGA CTGGGAGGCG GTGGCACAAG CCGAAGACCG CGCGCAGCGA CGGATATTCG GACTGCTTCC AGAAGCGCAG GGCCACCGGC TGCAGGCGCT CTGGCAGGAA TTTGAAGCGG CGCATACAGC AGATGCGCGC TTTGCAAAAT CGCTGGACTA CTGCCAGCCG ATCTTCCAGA CGCTTTGCGC TGTTTCGCCT CCCGCCGATC ACCTTCGGGT GGTACGCGAA AACCTGACCA CCGGTCGCGC CACCTCTCTT CGAGAGCGGT TTCCCGAGGC TTATGCAGCA GCATGCAGCC TCATTGACGG TCAGACCGTC AGCGATCCGG ACTTTGCAGC ACGGCTCGCG TTTCTGTCCG AAGCTGACCG GTTGAAGTCG GTTCTACGCG CCTCGCGGAT TGCCTCCGGC ACCCGATATG AAAACTCGGC AGAACACAGC TGGCACATCA TGCTCTATGG CTGGATCCTC GCTCCGCATA GCCTGTCGGA AGTCGACGTC TCGCGCGTTC TCAAGATGCT GCTACTGCAC GATCTGGTCG AGATTGACGC CGGCGATGTG CCCATTCACT CCAATCTGGA CGCCGCCGCG CTGCGCCAGA TCGAAGAGAC TGAGAAAGCC GCCGCAGAGC GGATCTTTGG GCTGTTGCCG GACGCGCAGG CCAAGGACTG CCTCATGATC TGGCAGGAAT TCGAAGCCGC CCAGAGCGCG GATGCGGTCT TTGCCAAATC CATCGACCGC GTGCAGCCGG TCTTGTTGAA TATTGCCACC GGCGGTGGCA GCTGGGTGGC CTATGATGTC ACCCTACCGC AGCTGGAAAC CCGCGTGGGC GTGAAAATTG CGCGGGGCGC ACCGAAGGTC TGGGACCATG TGCGTGCGCT TCTGTTGCCC TGGTTTACGG CACAAGGCCG CCTCTGA
|
Protein sequence | MTTDLEQQFE FLTEIERLRE VERQNLLLDG SRVENSAEHS WHLALYALVF APYAPSDVSI TRVIEMLLLH DIVEIDVGDH PIDEPTDWEA VAQAEDRAQR RIFGLLPEAQ GHRLQALWQE FEAAHTADAR FAKSLDYCQP IFQTLCAVSP PADHLRVVRE NLTTGRATSL RERFPEAYAA ACSLIDGQTV SDPDFAARLA FLSEADRLKS VLRASRIASG TRYENSAEHS WHIMLYGWIL APHSLSEVDV SRVLKMLLLH DLVEIDAGDV PIHSNLDAAA LRQIEETEKA AAERIFGLLP DAQAKDCLMI WQEFEAAQSA DAVFAKSIDR VQPVLLNIAT GGGSWVAYDV TLPQLETRVG VKIARGAPKV WDHVRALLLP WFTAQGRL
|
| |