Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0412 |
Symbol | |
ID | 5897686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 451568 |
End bp | 452893 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641560898 |
Product | fumarylacetoacetase |
Protein accession | YP_001682047 |
Protein GI | 167644384 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR01266] fumarylacetoacetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.784977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAGC CTCCCATTGA TGAGACGCAC GATCCTTCAC GCCGCAGTTG GGTGAGCTCG GCCCAGGGCT CGGCGTTCCC GATCCAGAAC CTGCCGCTTG GCGTGTTCAG TCCCGCCGGC GGACTTCCGC GCGGCGGGGT GGCGATCGGT GATCAGATCC TGGACCTGCT GGACGTCAGC GAGTCGGGCT ATTTCGAGGG CGAGGCGCGC AGCGCCGCCG AGGCGGCGGC CCGACCCGAT CTGACCAGCT ATCTGGGCCT GAAAGCCTCG GCGAGGCTGG CCTTGCGGCG GCGGCTTTCC CAGCTGCTGT CAGATCCCAC GCATCAGGCG GCCTTAACGC CCTATCTGCA CGCGGCCAGC GATTGCGCGC TGCTTATGCC GGTCCGCGTG GCCAACTACA CCGACTTCTT CGCCGGGGCC CACCACGCCG TGAACGCGGG CAAGATGCTC CGCCCCGACG CGCCCTTGTC GCCCAACTAC AAGCACGTGC CGATCGCCTA TCACGGACGC GCCTCATCCA TTCGGCCCAG TGGAACCCCG GTGCGTCGTC CTTGGGGGCA AGTCCTGCCC GGCGGCGCCC AAGCCCCTGA ACTTAGGCAG ACCCGGCGAT TGGACTACGA GTTCGAGCTG GGCCTTTGGG TGGGCGCGGG CAACGCCCTG GGTGAGCCGC TCCCAATCGC CCAGGCCGGG GAGCATCTGG TCGGGCTGTG CATTCTCAAC GACTTCTCCG CGCGGGACCT GCAAGCTTGG GAGGCTCAGC CGCTAGGGCC GTTCCTCAGC AAGAATTTCC TTAGCGTGAT CTCCCCCTGG ATCGTCACCG TCGAAGCCCT GGCGCCGTTC CGCGCCGCGC AGCGCGCAAG GCCGCAGGAG GATCCCCCGC CGCTCGCCTA CCTCTGGGAC GAGGCTGATC AAGCCGGCGG CGCCTTGGCC ATCGCCCTGG AGGCGTTCCT GTCGACCTCG GCCATGCGCC AGGCGGGCCT AGCGCCCGAT CGCATCGGCC TGGGACATGC CGCCCACCTC TATTGGACGC CGGCCCAGAT GATCGCCCAT CACACCGTGG GCGGCTGTGA TCTGGGCCCT GGGGACCTGC TGGGCACGGG AACGATATCG GCGCCGGACG CGCCGGACGT CTCGGGGGCC GGCTGTCTGC TGGAAATGAC AAAGGCGGGG CGTCAGCCCA TCACGCTCTC GAGCGGCGAG ACGCGCGGCT TCCTGGCGGA CGGTGACGAA ATCCTGCTGC GCGCGACCGC TCGGGCGGCG GGATTTGTCG ATATCGGATT TGGTGAGTGC CGCGCGCAGG TGGCGCCGGC CAATGAGGAG GCGTGA
|
Protein sequence | MLKPPIDETH DPSRRSWVSS AQGSAFPIQN LPLGVFSPAG GLPRGGVAIG DQILDLLDVS ESGYFEGEAR SAAEAAARPD LTSYLGLKAS ARLALRRRLS QLLSDPTHQA ALTPYLHAAS DCALLMPVRV ANYTDFFAGA HHAVNAGKML RPDAPLSPNY KHVPIAYHGR ASSIRPSGTP VRRPWGQVLP GGAQAPELRQ TRRLDYEFEL GLWVGAGNAL GEPLPIAQAG EHLVGLCILN DFSARDLQAW EAQPLGPFLS KNFLSVISPW IVTVEALAPF RAAQRARPQE DPPPLAYLWD EADQAGGALA IALEAFLSTS AMRQAGLAPD RIGLGHAAHL YWTPAQMIAH HTVGGCDLGP GDLLGTGTIS APDAPDVSGA GCLLEMTKAG RQPITLSSGE TRGFLADGDE ILLRATARAA GFVDIGFGEC RAQVAPANEE A
|
| |