Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3653 |
Symbol | |
ID | 5901108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3941996 |
End bp | 3943057 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641564164 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001685278 |
Protein GI | 167647615 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.117018 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCA ATGCGCAAAA TCCCCTCGGC CTCGACGGCT TCGAGTTCGT CGAGTTCACC AGCCCCGATC CAGCCGCCAT GAAGGCCCTG TTCGAACAGC TGGGCTTCGT CGCCGCCAGC CAGCATCCGA CCAAGGCCGT GACCCGCTAC AAGCAGGGCC GTATCAACCT GCTGGTCAAT GAAGAGACGT CCGGCCAGGT CGCCGCGTTC CGCGCCGCCC ACGGCCCCTC GGCCAACGGC ATGGCCTTCC GGGTCGAGAA CGTCGATCAG GCCTATGCCG AGGCCCTCAA GCGCGGCGCC GTCGCGGCGG ACGCCGGCAA GACCGTGCTG GGCGAGGGCG CCAAGGTGCT GGAAGGCATC GGCGGTTCGA TGCTGTACCT CGTCCCGGCC GAGGGCTCGG TCTATGACAG CTGGACCCCG GTCCCCGGCG CGGCGGAAGC CGAAGCGGCC AACAACGTCG GCCTCGACCT GCTCGACCAC CTGACCCACA ACGTCAAGCG CGGCCAGATG CGCACCTGGT CGACCTTCTA TCGCGACGTC TTCGGCTTCG AGGAGCAGAA GTATTTCGAC ATCAAGGGCC AGGCCACCGG CCTGTTCAGC CAGGCGATGA TCGCGCCAGA CAAGGCCATC CGCATCCCGC TGAACGAGAG CCAGGACGAC CACAGTCAGA TCGAGGAGTT CCTCCGCCAG TACAACGGCG AAGGCATCCA GCACCTGGCC CTGACCACGC CCGACATCTA CGACACCGTC GAGAAGCTGC GCGCCCGGGG CGTCAAGCTG CAGGACACCA TCGAGACCTA TTACGAGCTG GTCGACAAGC GCGTGCCAGG CCACGGCGAG GACCTGGAGC GCCTGAGGAA GAACCGCATC CTGCTGGACG GCAAGGTCGG CGAGGAAGGC CTGCTGTTGC AGATCTTCAC CGAGAACCTG TTTGGGCCGA TCTTCTTCGA GATCATCCAG CGCAAGGGCA ATGAAGGCTT CGGCAACGGC AACTTCCAGG CTCTGTTCGA GAGCATCGAG CTGGATCAGA TCCGCCGCGG CGTGATCACG GTCGAGGCCT AG
|
Protein sequence | MTVNAQNPLG LDGFEFVEFT SPDPAAMKAL FEQLGFVAAS QHPTKAVTRY KQGRINLLVN EETSGQVAAF RAAHGPSANG MAFRVENVDQ AYAEALKRGA VAADAGKTVL GEGAKVLEGI GGSMLYLVPA EGSVYDSWTP VPGAAEAEAA NNVGLDLLDH LTHNVKRGQM RTWSTFYRDV FGFEEQKYFD IKGQATGLFS QAMIAPDKAI RIPLNESQDD HSQIEEFLRQ YNGEGIQHLA LTTPDIYDTV EKLRARGVKL QDTIETYYEL VDKRVPGHGE DLERLRKNRI LLDGKVGEEG LLLQIFTENL FGPIFFEIIQ RKGNEGFGNG NFQALFESIE LDQIRRGVIT VEA
|
| |