Gene Cwoe_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2093 
Symbol 
ID8732536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2195230 
End bp2196411 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID646502711 
ProductHomogentisate 1 2-dioxygenase-like protein 
Protein accessionYP_003393893 
Protein GI284043553 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.6516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.270951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCGA TCACGCGGAA GGGCGAGATC CCGTCGACGC CGCAGGGCTA CGGCGACGGG 
ACCTACGTCG ACGAGGTCTT CACGCTGGAC GGGTTCTTCG GCGACTGGGC GCACATCTGG
CGGCACCGCA ACCCCGCGAC GCCGACGCGC TGGAGCGACG AGCGGATGAT CTACAACGGC
CTCGACAGCG GCGCGCTGGA GCCGACGGAC CGCTCCGATC CGCGCGGCAC GCCGATGACG
CTGCTGACCG GCCCGGGAGC CAGCGTCTCG CTCTCGCGGC GCACGGCGTC GATGCCGTTC
GCCGAGAAGA ACGTCGACGC CAACCAGATC CGCTTCTACC AGCAGGGGAG CTTCCGCCTG
GAGACGGAGC TGGGCCCGAT CGAGGTCGAG GCCGGCGACT TCGTCGTCAT CCCGAAGGGG
ATGATGTACC GCGAGATCGC GCTCACCGGC GACAACGCGA TCGTCATCTT CGAGGTCGAG
CGGTCGATCG CGCTGGCCGA GAAGCTGCAG GACCAGCTCG GCTTCGCCAG CCTCTTCATC
GACTACTCGA CGATGGAGCT GCCCGACCCC GCGGCGATCG ACGGCGACGC GTCGGCCGAG
ACCGAGGTGC GCGTGAAGTA CGACGGCGAG CACCACTTCG TCACGTACGA CTTCGACCCG
CTCTCCGACG TCGTCGGCTG GTCCGGCGAC CCTGTCCTCT ACAAGCTCAA CGTCTGGGAC
ATCCCGAGCC TCGGCAGCTC GGTCGGCTTC ACGAGCCCTC CGTCCAACGC CGTCCTCTTC
GCCGAGGACA AGTCGTTCTT CTTCAACGTG CTCGCCGCCA AGCCGTTCCC GTCCGAGCCC
GCGCCGCGGT CCAGCTACGG CGCCTCCTCG CACATGAACG ACTGCGACGA GGTGTGGCTC
AACCATGTCG CGTCGATCGC GCCCGAGACC AACGGGCACA TCTGGCTGTT CCCGCGCACG
ATCGCCCACC CCGGTCTCAA GGTCCCGCCG CAGTACCCCG AGAACCCGCC GAAGGCGATC
CGCGAGATCA AGATCAACTT CGACACGACC GCGAAGCTGA GCTGGACGCC GGAGGCGAAG
GCCGCGCTGC TGCCCGACCC GCTGACGGCG GTCTATACGA GCTTCTACGG CGCGCACGCC
GGCGTGTCCG CCGACGAGGC GCTGGAGCAC GTGCGACGCT GA
 
Protein sequence
MASITRKGEI PSTPQGYGDG TYVDEVFTLD GFFGDWAHIW RHRNPATPTR WSDERMIYNG 
LDSGALEPTD RSDPRGTPMT LLTGPGASVS LSRRTASMPF AEKNVDANQI RFYQQGSFRL
ETELGPIEVE AGDFVVIPKG MMYREIALTG DNAIVIFEVE RSIALAEKLQ DQLGFASLFI
DYSTMELPDP AAIDGDASAE TEVRVKYDGE HHFVTYDFDP LSDVVGWSGD PVLYKLNVWD
IPSLGSSVGF TSPPSNAVLF AEDKSFFFNV LAAKPFPSEP APRSSYGASS HMNDCDEVWL
NHVASIAPET NGHIWLFPRT IAHPGLKVPP QYPENPPKAI REIKINFDTT AKLSWTPEAK
AALLPDPLTA VYTSFYGAHA GVSADEALEH VRR