Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2208 |
Symbol | phhA |
ID | 5899663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2404955 |
End bp | 2405836 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562700 |
Product | phenylalanine 4-monooxygenase |
Protein accession | YP_001683834 |
Protein GI | 167646171 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3186] Phenylalanine-4-hydroxylase |
TIGRFAM ID | [TIGR01267] phenylalanine-4-hydroxylase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0291015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.429407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAG ACGGTTTTAG TAGCGGCCCG CCTCCCGGCG CGACGGCCGA TTGGACGATC GATCAGGGCT GGGAGGGCTA CTCTCAGGCT GAGCACGATC TCTGGATCAC GCTCTATGAG CGCCAGACCG CCATGCTGCC GGAACGGGCC TGCGACGAAT TCCTGCGCGG GCTCGACGCG CTCGACCTGC ACCGCTCCGG CATTCCCGAC TTCAAGCGGA TCAACGAGGA ACTCCAGCGC CTGACCGGCT GGAGCGTAGT GGCCGTGCCG GGCCTGGTTC CCGACGACGT GTTCTTCGAC CACTTGGCCA ATCGCCGCTT CCCCGCCGGC CAGTTCATCC GCGGGCCGCA CGAACTGGAC TACCTGCAGG AACCCGACAT CTTCCACGAC GTCTTCGGCC ACGTGCCGAT GCTGACCGAT CCGGTGTTCG CCGACTACAT GCAGGCCTAC GGCCAGGGCG GCCAGCGGGC GCTCGGCCTG GGGCGGCTGG CCAACCTGGC GAGGCTCTAC TGGTATACGG TCGAGTTCGG CCTGATGGAG ACGAAGGCAG GCCTGCGGAT TTATGGGGCC GGGATCGTGT CGTCGCGCGC CGAATCGCTG TTCGCCCTCG AAGACCCCTC TCCCAACCGC ATCGGCTTCG ACCTGGAGCG CGTGATGCGC ACGCCCTATC GGATCGACGA TTTCCAGCAG GTCTATTTCG TCATCCCCTC GATCCAGACC CTGCAGGAAG TCACCCTGCG CGACTTCGGG CCGCTCTATG ACCGCCTGGC CGGGGCCAGC GACCTCGGCA TCGCCGAGAT CGCTGGCCCC GATCGGGTGA TCACCGTCGG CAATCAGGCC TACGCGAAGG CCGGAGGGCG GCTGGCTGTG GCCGCCGACT AA
|
Protein sequence | MSADGFSSGP PPGATADWTI DQGWEGYSQA EHDLWITLYE RQTAMLPERA CDEFLRGLDA LDLHRSGIPD FKRINEELQR LTGWSVVAVP GLVPDDVFFD HLANRRFPAG QFIRGPHELD YLQEPDIFHD VFGHVPMLTD PVFADYMQAY GQGGQRALGL GRLANLARLY WYTVEFGLME TKAGLRIYGA GIVSSRAESL FALEDPSPNR IGFDLERVMR TPYRIDDFQQ VYFVIPSIQT LQEVTLRDFG PLYDRLAGAS DLGIAEIAGP DRVITVGNQA YAKAGGRLAV AAD
|
| |