Gene TM1040_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1237 
Symbol 
ID4076352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1333106 
End bp1334272 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID638006545 
ProductHD domain-containing protein 
Protein accessionYP_613232 
Protein GI99081078 
COG category[R] General function prediction only 
COG ID[COG1896] Predicted hydrolases of HD superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAG ATCTTGAGCA GCAATTCGAA TTCCTCACTG AAATCGAAAG GCTGCGCGAG 
GTGGAACGGC AGAACCTTCT GCTGGACGGC AGCCGGGTAG AAAATTCGGC CGAGCACAGC
TGGCATCTGG CGCTCTATGC GCTGGTGTTT GCCCCCTATG CGCCCTCCGA CGTGTCCATA
ACCCGTGTCA TCGAGATGCT CTTGCTGCAT GACATCGTGG AGATCGACGT TGGCGATCAT
CCGATTGATG AGCCAACAGA CTGGGAGGCG GTGGCACAAG CCGAAGACCG CGCGCAGCGA
CGGATATTCG GACTGCTTCC AGAAGCGCAG GGCCACCGGC TGCAGGCGCT CTGGCAGGAA
TTTGAAGCGG CGCATACAGC AGATGCGCGC TTTGCAAAAT CGCTGGACTA CTGCCAGCCG
ATCTTCCAGA CGCTTTGCGC TGTTTCGCCT CCCGCCGATC ACCTTCGGGT GGTACGCGAA
AACCTGACCA CCGGTCGCGC CACCTCTCTT CGAGAGCGGT TTCCCGAGGC TTATGCAGCA
GCATGCAGCC TCATTGACGG TCAGACCGTC AGCGATCCGG ACTTTGCAGC ACGGCTCGCG
TTTCTGTCCG AAGCTGACCG GTTGAAGTCG GTTCTACGCG CCTCGCGGAT TGCCTCCGGC
ACCCGATATG AAAACTCGGC AGAACACAGC TGGCACATCA TGCTCTATGG CTGGATCCTC
GCTCCGCATA GCCTGTCGGA AGTCGACGTC TCGCGCGTTC TCAAGATGCT GCTACTGCAC
GATCTGGTCG AGATTGACGC CGGCGATGTG CCCATTCACT CCAATCTGGA CGCCGCCGCG
CTGCGCCAGA TCGAAGAGAC TGAGAAAGCC GCCGCAGAGC GGATCTTTGG GCTGTTGCCG
GACGCGCAGG CCAAGGACTG CCTCATGATC TGGCAGGAAT TCGAAGCCGC CCAGAGCGCG
GATGCGGTCT TTGCCAAATC CATCGACCGC GTGCAGCCGG TCTTGTTGAA TATTGCCACC
GGCGGTGGCA GCTGGGTGGC CTATGATGTC ACCCTACCGC AGCTGGAAAC CCGCGTGGGC
GTGAAAATTG CGCGGGGCGC ACCGAAGGTC TGGGACCATG TGCGTGCGCT TCTGTTGCCC
TGGTTTACGG CACAAGGCCG CCTCTGA
 
Protein sequence
MTTDLEQQFE FLTEIERLRE VERQNLLLDG SRVENSAEHS WHLALYALVF APYAPSDVSI 
TRVIEMLLLH DIVEIDVGDH PIDEPTDWEA VAQAEDRAQR RIFGLLPEAQ GHRLQALWQE
FEAAHTADAR FAKSLDYCQP IFQTLCAVSP PADHLRVVRE NLTTGRATSL RERFPEAYAA
ACSLIDGQTV SDPDFAARLA FLSEADRLKS VLRASRIASG TRYENSAEHS WHIMLYGWIL
APHSLSEVDV SRVLKMLLLH DLVEIDAGDV PIHSNLDAAA LRQIEETEKA AAERIFGLLP
DAQAKDCLMI WQEFEAAQSA DAVFAKSIDR VQPVLLNIAT GGGSWVAYDV TLPQLETRVG
VKIARGAPKV WDHVRALLLP WFTAQGRL