Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4250 |
Symbol | |
ID | 5060735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 4818443 |
End bp | 4819648 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640476512 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001161056 |
Protein GI | 145596759 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.365692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0823259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGG CGATCGACCG ACCCCAGACG AGTGACGAGG TCGACGCCGA CCTGCTGGTC GGCGCCGTTG ACCACGACAT CAGCCACGAC CCGTTCCCGG TCAAGGGTCT CGACCACGTG CAGTTCCTGG TCGGCAACGC CAAGCAGGCC GCGCACTACT ACTCCACCGC CTTCGGCATG ACCTGCGTGG CCTACCGGGG GCCGGAGCAA GGCTACCGGG ATCACGCTCA GTACGTGCTG ACCAGTGGTT CGGCCCGCTT CGTCCTCACC GGCGCGGTCC GCCCGGACGC GGCCGGTGCC GAGCAGGTCG CCCGGCACAG CGACGGGGTC TGCGACATCG CGCTGGAGGT CCCCGACGTT GACGCGGCGC ACGCGCACGC CATCGCCCAG GGCGCGATCA GCCTTGCTGA GCCGTACGAG GTCAGCGACG AACACGGCAC GGTCCGGCTC GCCGCCATCG CCACGTATGG TGACACCCGC CACACCCTGG TGGACCGCTC CCGCTACCAC GGCCCGTTCC TACCCGGCTA CGTCGCCCGC CGACCGATCG TCGACCGCCA GCCAATGATC GACGCTGGCG TCCAGCCGAA GCGCTTCTTC CAGGCGATCG ACCACGTCGT CGGCAACGTC GAGCTGGGTC GCATGGACGA GTGGGTCGAG TTCTACCAGC GGGTGATGGG CTTCACCAAC ATGGCGGAGT TCGTCGGCGA CGACATCGCC ACCGACTACT CGGCGCTGAT GAGCAAGGTC GTCGCCAACG GCACCCGGAA GGTGAAGTTT CCGCTCAACG AGCCGGCGGT CGCCCGGAAG AAGTCGCAGA TCGACGAATA CCTGGACTTC TACCAGGGCC CCGGGGCCCA GCACATCGCG GTGGCCACCA ACGACATCCT GGCCAGCGTG GACGCGATGC GCGCGGCAGG CGTGGACTTC CTGGACACCC CCGACTCGTA CTACGACGAC CCGGAGCTGC GGGCCCGGAT CGGCGAGGTC CGGGTTCCGA TCGAGGAGCT GAAGGCCCGC CGGATCCTGG TCGACCGGGA CGAGGACGGC TACCTGCTCC AGATCTTCAC CAACCCGGTG CAGGACCGCC CGACCGTCTT CTTCGAGCTG ATCGAGCGAC ACGGCTCGCT CGGCTTCGGC AAAGGCAACT TCAAGGCGCT CTTCGAGGCC ATCGAGCGGG AGCAGGACAA GCGCGGCAAC CTGTGA
|
Protein sequence | MTQAIDRPQT SDEVDADLLV GAVDHDISHD PFPVKGLDHV QFLVGNAKQA AHYYSTAFGM TCVAYRGPEQ GYRDHAQYVL TSGSARFVLT GAVRPDAAGA EQVARHSDGV CDIALEVPDV DAAHAHAIAQ GAISLAEPYE VSDEHGTVRL AAIATYGDTR HTLVDRSRYH GPFLPGYVAR RPIVDRQPMI DAGVQPKRFF QAIDHVVGNV ELGRMDEWVE FYQRVMGFTN MAEFVGDDIA TDYSALMSKV VANGTRKVKF PLNEPAVARK KSQIDEYLDF YQGPGAQHIA VATNDILASV DAMRAAGVDF LDTPDSYYDD PELRARIGEV RVPIEELKAR RILVDRDEDG YLLQIFTNPV QDRPTVFFEL IERHGSLGFG KGNFKALFEA IEREQDKRGN L
|
| |