Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5542 |
Symbol | |
ID | 8736017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5935322 |
End bp | 5936002 |
Gene Length | 681 bp |
Protein Length | 226 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646506172 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 2 (HAD-like) |
Protein accession | YP_003397322 |
Protein GI | 284046982 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.350775 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTC CCAGAAACGT CACCTTCGTC ACCTTCGACG TCTACGGCAC CCTCATCGAC TGGGATTCCG GGGCATACGA CGCCTTCGCC AAGGAGGCGA GACGCGACGG CTTCACGATC GATCGCGACG AGCTGATCCC GCTGTTCCAC TCGATCCAGC AGGAGATCCA GGCCGGCTCC TACGAGCTCT ACGCCGAGGT CCTGCGCCGC ACGGCCGTCC GCATCTCCGA GGAGATGAGA TGGGGCCTGG AGCCGTCGCG CTCGGGCTTC CTGCCCGACT CGGTCCAGCG CTGGCCCGCG TTCAGAGAGG CCAACCCGAC GCTGGCGAGA TTCGCCAGAC AGTTCAGAAC CGGGCTGATC TCGAACATCG ACGACAAGCT GCTCGGTCAG ACGCGGCGCC ACATCCCGCT CGACTTCGAC CTCGTCGTCA CCGCCCAGCA GGTGCGCTCC TACAAGCCCG ACCCGGCGCA CTTCAAGGAG TGCGAGCGGC GCGTCGGCGG CAAGAGAGGC TGGGTGCACG TCTCCTCCAG CTATCCGACG GACGTCGAGC CCTGCCTCAG AGCGAGAGTG CCGGTCATCT GGGTCAACCG CGACAGAGCG ACGCTCGAGA GCGGCCAGAA GAAGCCCGAC GCGGAGGTCA CGAACCTGCG CGAGGCGCTG AGACTGCTCG CCGGCGACTG A
|
Protein sequence | MAIPRNVTFV TFDVYGTLID WDSGAYDAFA KEARRDGFTI DRDELIPLFH SIQQEIQAGS YELYAEVLRR TAVRISEEMR WGLEPSRSGF LPDSVQRWPA FREANPTLAR FARQFRTGLI SNIDDKLLGQ TRRHIPLDFD LVVTAQQVRS YKPDPAHFKE CERRVGGKRG WVHVSSSYPT DVEPCLRARV PVIWVNRDRA TLESGQKKPD AEVTNLREAL RLLAGD
|
| |