Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3853 |
Symbol | |
ID | 5901315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4168341 |
End bp | 4169531 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641564375 |
Product | 3-oxoadipate enol-lactonase |
Protein accession | YP_001685477 |
Protein GI | 167647814 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit |
TIGRFAM ID | [TIGR02425] 4-carboxymuconolactone decarboxylase [TIGR02427] 3-oxoadipate enol-lactonase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.126877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.866495 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTCG CCACCCACGA CGGCGCGCGG ATCTACTGGC GGCTGGACGG CGCGGCCGAC AAGCCCGCCC TGGTGCTGCT GAACTCGATC GGCACGGACA TGGGGCTGTA CGACGCCGCC GCCCCGCTGC TGCTGGCCGA CTTCCGCCTG CTGCGCATCG ACACGCGCGG CCATGGCGCC TCGGACGCCC CGGCCGTCGA CTACACGCTG GACCAGTTGG CGGGCGACGC GCTGGCGGCG ATGGACGCGG CGGGCCTGGC CACGGCCAGC GTCGTCGGCG TCTCGCTGGG CGGCATGGTC GCCATGGCCC TGGCCCTGAA GGCGCCCGAG CGGGTCGAGG GGCTGGTCCT GGCCTGCACC TCGGCGGCCA TGGACGTCGC CGCCTGGACC GCCCGCATCG CCACCGTCCG CGCCGAAGGC ATGGCCGCCA TCGCCGAGAT GGCCCTGGGC CGGTTCTTCT CCGAGCCCTT CCGCGGCCAG CATCCCGCCA CGGTCGAGAC CGTGCGCGCC GGTCTGCTGG CCATGAGCCC CGATGGCTAC AGCGGCTGCG GAGCCGCGAT CCGCGACATG GACCTGCTGG CGCGGATCTC CGCCATCACC GCCCCCACCC TGGTGATCGG CGGACGCAAG GACGTCTCGA CGCCGTTCGA GGGGAATGGC GACCGGATCG TGGCGGCCAT CCCGGGCGCG ACCTCGGCCA TGCTCGACAC CGCTCACCTG CCCAGCCTGG AAGACCCCAC CGCCTTCGCC GGCGCCGTGC GCAGCTTTCT GGCCAGCACC CGCGACGGTT CGGGCGTCAG CGCGGCGGCG GACGTGCTGT TCGAGGCCGG CCTCGTCCAT CGTCGCAAGG TGCTGGGCGA CGCCTGGGTC GACCGTTCGC TGGCCAAGCG CACCCCCTTC ACCGCCGACT ACCAGGCGAT GATCACCCGC TATGCCTGGA ACGAGATCTG GGGCCGGCCG GGCCTGGACC ATCGCACGCG TCGCCTGCTG GTGCTGGCCA TCTGCGCCTC GCTGGCCCGC TGGGAAGAGT TTCGCCTCCA CGTCCGCGCC GGCCTGGAAC AGGGCGGCTT CACCCAGGAC GAGCTCAAGG AGGTGCTGAT GCAGACGGCG ATCTATGCGG GGGTGCCCGC CGCCAACACC GCCTTCACGG AGGCCGCCGA AATCATCGCC GAGTTGGGCG GGGCTGATTG A
|
Protein sequence | MAFATHDGAR IYWRLDGAAD KPALVLLNSI GTDMGLYDAA APLLLADFRL LRIDTRGHGA SDAPAVDYTL DQLAGDALAA MDAAGLATAS VVGVSLGGMV AMALALKAPE RVEGLVLACT SAAMDVAAWT ARIATVRAEG MAAIAEMALG RFFSEPFRGQ HPATVETVRA GLLAMSPDGY SGCGAAIRDM DLLARISAIT APTLVIGGRK DVSTPFEGNG DRIVAAIPGA TSAMLDTAHL PSLEDPTAFA GAVRSFLAST RDGSGVSAAA DVLFEAGLVH RRKVLGDAWV DRSLAKRTPF TADYQAMITR YAWNEIWGRP GLDHRTRRLL VLAICASLAR WEEFRLHVRA GLEQGGFTQD ELKEVLMQTA IYAGVPAANT AFTEAAEIIA ELGGAD
|
| |