Gene TM1040_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0854 
Symbol 
ID4076029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp906573 
End bp907493 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content62% 
IMG OID638006152 
Producthaloacid dehalogenase-like hydrolase 
Protein accessionYP_612849 
Protein GI99080695 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.770562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCC AAAGTACAGA TATCGGCACT GACTGGGCGT TTCAGCGCTA TGAGAGCGTG 
CGTGCAACCC TGCCCGAGGC CAGTTTTGCG GCGGCCTCAC GGCGGGGAGG AGATCTTGGC
GACACCGTCG GAGATTTTGA CGCCTATATC CTCGATGCCT TTGGGGTTTT GAACCGGGGC
GAGACCGCAA TTGCGGGCGC GGTGGAGCGC ATGGCAGCAC TCAGGGCGCT TGGTAAAAGG
CTGGTGGTGC TGACCAATGC GGCAAGCTAC ACGCGCGCAG AGGTGCTGGC GAAATATCAC
CGGCTTGGCT TTGACTTCGA CGCGTCAGAA GTGGTCTCAA GCCGCGATGT GGCCTTTGCC
GGTCTGCCCG CACTCCCGGC CGGCGCATTT TGGGCCGCCG CTGCCGCAGC AGGTGATGAT
TTCAGTGATG CCCCCAGCGG CGCTGAAATC GCGCATCTGG CAGAGCGGCC GGAGCTCTTG
CAAAGCGCGG GTGGCTTTCT GCTGCTGTCC TCTGCACGCT GGAGCGCGGC CGAAACAGAC
GCGCTCACCG AGGCATTGTT GGCGTCTCCG CGTCCTCTGG TGGTCGCAAA CCCCGATCTC
GTCGCCCCGC GCGAGGATGG CCTTTCGATG GAGCCGGGCC TGATCGCGCA GGAGCTGACC
GAGCGCACCG GTCAGCCTGC AGCGTTTTTT GGCAAACCCT TTGGCAACGC CTTTGACGCG
GCACTCGCGC GGCTCTCTGG CATTGAGCGC ACGCGTATTG CAATGGTCGG CGATACGCTG
CACACGGATG TTCTGGGAGG CGCGGCTGCA GGGATCGGCT CCATCCTGAT CACCGATCAC
GGCCTTTTTA AGGGCCATGA TGTCGCGCCA TACATCGAAA AGAGCGCAAT TCGACCGAGT
TGGATCGTCT CGACAACATA A
 
Protein sequence
MAGQSTDIGT DWAFQRYESV RATLPEASFA AASRRGGDLG DTVGDFDAYI LDAFGVLNRG 
ETAIAGAVER MAALRALGKR LVVLTNAASY TRAEVLAKYH RLGFDFDASE VVSSRDVAFA
GLPALPAGAF WAAAAAAGDD FSDAPSGAEI AHLAERPELL QSAGGFLLLS SARWSAAETD
ALTEALLASP RPLVVANPDL VAPREDGLSM EPGLIAQELT ERTGQPAAFF GKPFGNAFDA
ALARLSGIER TRIAMVGDTL HTDVLGGAAA GIGSILITDH GLFKGHDVAP YIEKSAIRPS
WIVSTT