Gene TM1040_0589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0589 
Symbol 
ID4078627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp629035 
End bp629910 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content60% 
IMG OID638005886 
ProductHAD family hydrolase 
Protein accessionYP_612584 
Protein GI99080430 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00195304 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCAGA TCATCTCCGC CCTTTCCGAG GTTTCAGACC GATACAAGGC TCTGTTCGTG 
GATCTCTGGG GCTGCGTTCA CAACGGGATC ACGGCCTACC CGGACGCTGT CGCGGCCCTT
CAGGCCTACC GCAAATCCGG CGGAGTGGTG GTACTGGTCA CCAACTCGCC CAAACCCCGC
GCAGGGGTGG CAGAGCAACT GAGCCAGTTC GGCGTCCCAG ATGACGCCTA TGACACCATC
GCTACCTCTG GCGATTCCGC GCGCGCGGCG ATGTTCACAG GTGCTGTCGG TGAAAAGGTC
TACTTCATGG GCGAATGGGA GCGCGATGCC GGCTTTTTTG AGCCAATGAA GGTTATCCAC
GAGCCCATTG AGATCACCCG CGTCCCGCTT AAGGAAGCCG AAGGGATCGT CTGCTGCGGT
CCCTTCGACA CCTTGGCCGA CCCGGAGGTG AACCGCGCCG ATTTTCTCTA TGCCAAACAG
ATGGGTATGA AGCTTCTTTG CGCGAATCCG GATATTATCG TCGACCGTGG CGAGGTACGC
GAATGGTGCG CTGGGGCCTT GGCCAAACTT TACACCGAAA TGGGGGGCGA GAGCCTCTAT
TTCGGCAAGC CGCATCCGCC GATCTACGAT CTCGCGCGTC GTCGCCTGAC GGAAATTGGC
CACGATGTTT CGGACCGCGA CATTCTCGCA ATTGGCGACG GGCCGCACAC CGACATCTCA
GGCGGCATGG GTGAAGGGGT GGACACGCTG TTCATCACCG GCGGCTTGGC CGCAAAAGAC
ACCCAAACGG CTCATCAGCC CGAGCCTGCT GCACTGGAGG CGTATCTCGC GCAGGAACAG
ATCGCGCCCA CCTACAGCAT CGGCTTTCTG CGCTAA
 
Protein sequence
MSQIISALSE VSDRYKALFV DLWGCVHNGI TAYPDAVAAL QAYRKSGGVV VLVTNSPKPR 
AGVAEQLSQF GVPDDAYDTI ATSGDSARAA MFTGAVGEKV YFMGEWERDA GFFEPMKVIH
EPIEITRVPL KEAEGIVCCG PFDTLADPEV NRADFLYAKQ MGMKLLCANP DIIVDRGEVR
EWCAGALAKL YTEMGGESLY FGKPHPPIYD LARRRLTEIG HDVSDRDILA IGDGPHTDIS
GGMGEGVDTL FITGGLAAKD TQTAHQPEPA ALEAYLAQEQ IAPTYSIGFL R