Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2093 |
Symbol | |
ID | 8732536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2195230 |
End bp | 2196411 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646502711 |
Product | Homogentisate 1 2-dioxygenase-like protein |
Protein accession | YP_003393893 |
Protein GI | 284043553 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.6516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.270951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCGA TCACGCGGAA GGGCGAGATC CCGTCGACGC CGCAGGGCTA CGGCGACGGG ACCTACGTCG ACGAGGTCTT CACGCTGGAC GGGTTCTTCG GCGACTGGGC GCACATCTGG CGGCACCGCA ACCCCGCGAC GCCGACGCGC TGGAGCGACG AGCGGATGAT CTACAACGGC CTCGACAGCG GCGCGCTGGA GCCGACGGAC CGCTCCGATC CGCGCGGCAC GCCGATGACG CTGCTGACCG GCCCGGGAGC CAGCGTCTCG CTCTCGCGGC GCACGGCGTC GATGCCGTTC GCCGAGAAGA ACGTCGACGC CAACCAGATC CGCTTCTACC AGCAGGGGAG CTTCCGCCTG GAGACGGAGC TGGGCCCGAT CGAGGTCGAG GCCGGCGACT TCGTCGTCAT CCCGAAGGGG ATGATGTACC GCGAGATCGC GCTCACCGGC GACAACGCGA TCGTCATCTT CGAGGTCGAG CGGTCGATCG CGCTGGCCGA GAAGCTGCAG GACCAGCTCG GCTTCGCCAG CCTCTTCATC GACTACTCGA CGATGGAGCT GCCCGACCCC GCGGCGATCG ACGGCGACGC GTCGGCCGAG ACCGAGGTGC GCGTGAAGTA CGACGGCGAG CACCACTTCG TCACGTACGA CTTCGACCCG CTCTCCGACG TCGTCGGCTG GTCCGGCGAC CCTGTCCTCT ACAAGCTCAA CGTCTGGGAC ATCCCGAGCC TCGGCAGCTC GGTCGGCTTC ACGAGCCCTC CGTCCAACGC CGTCCTCTTC GCCGAGGACA AGTCGTTCTT CTTCAACGTG CTCGCCGCCA AGCCGTTCCC GTCCGAGCCC GCGCCGCGGT CCAGCTACGG CGCCTCCTCG CACATGAACG ACTGCGACGA GGTGTGGCTC AACCATGTCG CGTCGATCGC GCCCGAGACC AACGGGCACA TCTGGCTGTT CCCGCGCACG ATCGCCCACC CCGGTCTCAA GGTCCCGCCG CAGTACCCCG AGAACCCGCC GAAGGCGATC CGCGAGATCA AGATCAACTT CGACACGACC GCGAAGCTGA GCTGGACGCC GGAGGCGAAG GCCGCGCTGC TGCCCGACCC GCTGACGGCG GTCTATACGA GCTTCTACGG CGCGCACGCC GGCGTGTCCG CCGACGAGGC GCTGGAGCAC GTGCGACGCT GA
|
Protein sequence | MASITRKGEI PSTPQGYGDG TYVDEVFTLD GFFGDWAHIW RHRNPATPTR WSDERMIYNG LDSGALEPTD RSDPRGTPMT LLTGPGASVS LSRRTASMPF AEKNVDANQI RFYQQGSFRL ETELGPIEVE AGDFVVIPKG MMYREIALTG DNAIVIFEVE RSIALAEKLQ DQLGFASLFI DYSTMELPDP AAIDGDASAE TEVRVKYDGE HHFVTYDFDP LSDVVGWSGD PVLYKLNVWD IPSLGSSVGF TSPPSNAVLF AEDKSFFFNV LAAKPFPSEP APRSSYGASS HMNDCDEVWL NHVASIAPET NGHIWLFPRT IAHPGLKVPP QYPENPPKAI REIKINFDTT AKLSWTPEAK AALLPDPLTA VYTSFYGAHA GVSADEALEH VRR
|
| |