Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3651 |
Symbol | |
ID | 5901106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3939503 |
End bp | 3940510 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564162 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_001685276 |
Protein GI | 167647613 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.188386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCG CGTCTCTGAA GGGCGGCCGC GACGGCCGGC TGGTGATGGT CTCCAACGAC TTGGCCTGGT TCACCGACGC CGGAACGATC GCCCCGACCC TGCAAGCCGC TCTCGACGAC TGGGAGCGCT GCGAGCCGAT GCTGCGCGCC CTGGCCGAGA GCCTCGAGCA CGGCGGCGTG CCGCGCGAGC GCTTCCACGA GCACGAGGCC GCTAGCCCCT TGCCCCGCGC CTATCAGTGG GTCGACGGCA GCGCCTATGT GAACCACGTG CAACTGGTGC GCCAGGCGCG GGGGGCGCAG ATGCCGGAGA GCTTCTGGAC CGATCCCCTG ATGTACCAGG GCGCCTCGGA CGGGTTCCTG GGGCCGCGCG ACGCGATCCC GCTGGCTGAC GAGGCCTGGG GCTGCGACCT GGAGGGCGAG GTGGCCGTGG TCACCAGCGA CGTGCCGCTG GGCGCGAGCC GCGAGGAGGC CCTGGCGGCG ATCCGCCTGG TGATGCTGGT CAACGACGTT TCCCTGCGCG CCCTGATCCC CGCCGAACTG GCCAAGGGTT TCGGCTTCGT GCAGTCCAAG CCGGCCAGCG CCTTCTCGCC GGTCGCCGTC TCGGTCGACG CCCTGGGCGA GGCCTGGAAG GACGGCAAGC TGTCGGGCGC TCTGCTGGTT GAACTGAACG GCGAGGAGTT CGGCAGGGCC GATGCGGGGG TCGACATGAC CTTCGATTTC GGAACCCTGG TGGCCCACGC CGCCAAGACC CGCGCCTTGG CCGCCGGCAC GATCGTCGGC TCGGGCACGG TGAGCAACCG CGACGCGGAC GGCGGTCCCG GCAAGCCGGT GGCCGACGGC GGCCTGGGCT ATTCGTGCCT GGCCGAGCTG CGCACGATCG AGACCTTGCA GACCGGCCAC CCCAAGACGC CGTTCTTGAA GGTCGGCGAC ACCGTCCGCA TCGAGATGCG CGACGCCAAG GGCCATACGA TCTTCGGCGC CATCGAGCAG GCGGTGGCTT CCATTTGA
|
Protein sequence | MKLASLKGGR DGRLVMVSND LAWFTDAGTI APTLQAALDD WERCEPMLRA LAESLEHGGV PRERFHEHEA ASPLPRAYQW VDGSAYVNHV QLVRQARGAQ MPESFWTDPL MYQGASDGFL GPRDAIPLAD EAWGCDLEGE VAVVTSDVPL GASREEALAA IRLVMLVNDV SLRALIPAEL AKGFGFVQSK PASAFSPVAV SVDALGEAWK DGKLSGALLV ELNGEEFGRA DAGVDMTFDF GTLVAHAAKT RALAAGTIVG SGTVSNRDAD GGPGKPVADG GLGYSCLAEL RTIETLQTGH PKTPFLKVGD TVRIEMRDAK GHTIFGAIEQ AVASI
|
| |