Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3652 |
Symbol | |
ID | 5901107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3940558 |
End bp | 3941841 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641564163 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001685277 |
Protein GI | 167647614 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.207248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCC AGTACCAATC CGGCTTCGCC AACCACTTCA GCACCGAGGC CGTCCCCGGC GCCCTGCCGG TGGGCCAAAA CTCGCCGCAG GCGCCGCCCT ACGGCCTCTA TGCCGAGCAA CTGTCGGGCA CGGCCTTCAC CGCGCCGCGC CACGAGAACC GCAGAAGTTG GCTGTACCGC CTGCGACCCA GCGCCGGCCA TGGACCCTAT GCGCCCTATG TCCAGGAGCG CCTGAAGAGC GGCCCGTTCG GCGCCGCCGT CCCGACGCCC AATCGCCTGC GTTGGGATCC ACTGGAGATC CCCGAAGCGC CGCTCGACTT CGTCGATGGT CTCGTCACCC TGGCCGGCAA CGGCGACGTG GCGACCCAGG CCGGCATGGC CGCGCATCTG TATCTCGCCA ACCGCTCGAT GATCGACCGG GTGTTCCAGA ACGCCGACGG CGAGCTGTTG ATCGTGCCCC AGCTGGGCGC CCTGCGTTTC GTCACCGAGT TGGGCGTGAT CGACGCCGCT CCAGGCGAGG TCGTGGTCAT TCCGCGCGGC GTGCGGTTCC GCGTCGAGCT TGAGGGGCCG GTTCGCGGCT ATGTCTGCGA GAACTATGGC CCCATGTTCC GCCTGCCCGA ACTGGGACCG ATCGGCTCGA ACGGCCTGGC CAACAGCCGC GATTTCCTGA CCCCCGTCGC CGCCTTCGAG GATGTTGAGC GCCCGACCGA GGTGATCCAG AAGTTCCAGG GCGGCCTGTG GACGGGAACC TGGGACCACA GCCCGCTGGA CGTCGTCGCC TGGCACGGCA ATCTGGCGCC CTACAAGTAC GACCTGGCGC GGTTCAATAC GATGGGCACG GTCAGCTTCG ACCATCCCGA TCCGTCGATC TTCACGGTCC TCACCGCGCC CAGCGAGATC CCGGGCACGG CCAATGTCGA TTTCGTGATC TTCCCGCCGC GCTGGATGGT GGCCGAGCAC ACCTTCCGGC CGCCCTGGTT CCACCGCAAC GTGATGAGCG AGTTCATGGG GCTGGTCACC GGCGCCTACG ACGCCAAGGC TGGCGGCTTC AGTCCGGGCG GGGCTTCCCT GCACAACATG ATGAGCGACC ACGGTCCGGA CGTGGCCAGC CACAAGGCCG CCAGCGAGGC CGATTTGAGT CCGCACAAGA TCGAGGCGAC CATGGCTTTC ATGTTCGAGA GCCGCTGGGT GATCCGTCCC ACGAAATACG CTCTGGAGAC TTCTGAACTT CAGGCCGACT ATGACGCGTG CTGGACGGGC TTTCCCAAGG CCAAGCTGCC TTAG
|
Protein sequence | MDLQYQSGFA NHFSTEAVPG ALPVGQNSPQ APPYGLYAEQ LSGTAFTAPR HENRRSWLYR LRPSAGHGPY APYVQERLKS GPFGAAVPTP NRLRWDPLEI PEAPLDFVDG LVTLAGNGDV ATQAGMAAHL YLANRSMIDR VFQNADGELL IVPQLGALRF VTELGVIDAA PGEVVVIPRG VRFRVELEGP VRGYVCENYG PMFRLPELGP IGSNGLANSR DFLTPVAAFE DVERPTEVIQ KFQGGLWTGT WDHSPLDVVA WHGNLAPYKY DLARFNTMGT VSFDHPDPSI FTVLTAPSEI PGTANVDFVI FPPRWMVAEH TFRPPWFHRN VMSEFMGLVT GAYDAKAGGF SPGGASLHNM MSDHGPDVAS HKAASEADLS PHKIEATMAF MFESRWVIRP TKYALETSEL QADYDACWTG FPKAKLP
|
| |