Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5251 |
Symbol | |
ID | 5897263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | + |
Start bp | 183881 |
End bp | 184792 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641555354 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_001676685 |
Protein GI | 167621900 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.62648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTGA TCACCTACCG CGCCCAGATC GACGACCGCG CCCGTCTGGG GGCGATCGTG GGCGACCTTG TCGTCGACCT GCAACGGCTT GGCGCGTCCC ACGGCCTTGT CGTGCCCGGC GCCATGCTGG ACCTGATCGA CCTGGGCCCC TCGGGCCTGG CCAGCGTCGC CAAGCTGCTG GCCGAACATC AGGGGACCTG GCCGCTCGGC GTCGCGATCG CCTTGGCCAA CGTCACGCTG CTGGCCCCCA TCCCGCGCCC TCGCAAGAAC ATCTTCGGCA TCGGGCTCAA CTACCTCGAT CACGTCGCCG AGTCGGCGCA GGCGCTGGAC ACCTCGCCCG ACCTGCCCAG GCAACCGGTG GTCTTTTCAA AGCCCCCGAC TACGATCATT GGGCCGGATG CGGCGATACC CCACAACGCC AAGATCACCC AGCAGATGGA TTGGGAGGTC GAACTGGCGG TCGTGATCGG CTCCACGGCC CGGCGGATCG ATCGGGCGCG CGCCCTGGAC CATGTGTTCG GCTACACGGT CATGATCGAC ATCAGCGCGC GCGATTGCCG TCGGGCGGGC CAGTGGATCT ACTCCAAGGG CCAGGACGGC TATGGACCGA TGGGACCGAT GATCGTCACC GCCGACGAAG TTCCCGATCC GCAGAACCTG GACCTGTGGC TAAGCGTCAA CGGCGTCGAG AAGCAGCGCT CCAACACCAG TCACATGCTG TTCAAGGTCG ACGAACTGAT CGCCGACCTC AGCCAGGGCA TCACGCTCGA ACCCGGCGAC ATCATCGCCA CGGGGACGCC GGACGGCGTC GGCGCCGGCC GTACGCCGCA GGAATGGCTT TGGCCGGGCG ACGTCGTGGA GGCTCATGTC GCCACCATCG GTGCGCTGCG CAACGTCGTG ATCGCAGCGT GA
|
Protein sequence | MRLITYRAQI DDRARLGAIV GDLVVDLQRL GASHGLVVPG AMLDLIDLGP SGLASVAKLL AEHQGTWPLG VAIALANVTL LAPIPRPRKN IFGIGLNYLD HVAESAQALD TSPDLPRQPV VFSKPPTTII GPDAAIPHNA KITQQMDWEV ELAVVIGSTA RRIDRARALD HVFGYTVMID ISARDCRRAG QWIYSKGQDG YGPMGPMIVT ADEVPDPQNL DLWLSVNGVE KQRSNTSHML FKVDELIADL SQGITLEPGD IIATGTPDGV GAGRTPQEWL WPGDVVEAHV ATIGALRNVV IAA
|
| |