Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2727 |
Symbol | |
ID | 5059190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 3072970 |
End bp | 3074046 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640474983 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001159549 |
Protein GI | 145595252 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000620471 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.914305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAACCG ATGACATAAC AGTCTTCGAC AACATGGTTC TCGACCACGT CAGCTTCTAC GCCGAGGACG CTTCAGCCGC CGCGAAATGG CTCGTTTCCG GGTACGGGTT CGCCGAGTAC GAGGACGATC GAGCCTTCAC CGGGTTCCCC GACGCCCGCT CGGTGGTCGT CGGCGTGAAT GATATCCGTT TCCGGATCAC ACAGCCTCTG GCTGACAGCC ATCCGGCCAA CCGCTACCTC ACCCGACACG GTGACGGAGT AGCCGACATT GCACTCCGGG TCCCGGACGC CACGGCTGCC TACCTGGCCG CGGTGCGACG TGGTGCGACA CCAGTGGCGG AGCCGACCGA GCGGGGCGGC CTGGTGACCG CGACCATCGG CGCTTTCGGC GACGTGACGC ACACTTTCGT GCAGGGCCGT GCTGGCATGA TCGGAGTGGT TCCGGCGGAA GGGCCGGGGA ACCGTCGCAG CACATCCGGT GCCGACGGTG AACTGGGCGA GATCGACCAC TTCGCGGTAT GCGTGTACGC CGGTGACCTG GACACGACCG TGTCATTCTA TCGGGATGTT CTGGACTTCG AACTGATCTT CGCTGAGCGG GTTCAGGTGG GGTCACAGGC AATGACCACC AAGGTGGTAC AAAGCCGTTC CGGTACCTTG ACCTTGACGT TGATCGAGCC AGACCTGAAC TGCGAGGCAG GCCATATCAA CGACTTCCTG ACCAATCACC GTGGACCAGG GGTACAGCAC ATCGCGTTCA CCGCGGAGTG TATCGTACAA GCTGTCGACG TGATCGGAGC ACGCGGGGTA GAGCTACTCT CCACGCCAGA TGCCTACTAT GCCGCACTAC CCGCCCGAAT GGACCTGGCC CGGTACACCG TTGACGAGCT GCGCAGTCGA AGCATTCTGG TGGACAGCGA CCATGACGGG CAGCTTTACC AGATTTTCAC CAAATCGGTG CACCCGCGGA ACACCATCTT CCTGGAGATC ATCGAGCGTC TCGGCGCTCG AGGCTTCGGC AGTGGCAACA TCAGGGCGTT GTATGAGGCG GTCGAGCGTA CGCGGGAGCA GGAATGA
|
Protein sequence | MSTDDITVFD NMVLDHVSFY AEDASAAAKW LVSGYGFAEY EDDRAFTGFP DARSVVVGVN DIRFRITQPL ADSHPANRYL TRHGDGVADI ALRVPDATAA YLAAVRRGAT PVAEPTERGG LVTATIGAFG DVTHTFVQGR AGMIGVVPAE GPGNRRSTSG ADGELGEIDH FAVCVYAGDL DTTVSFYRDV LDFELIFAER VQVGSQAMTT KVVQSRSGTL TLTLIEPDLN CEAGHINDFL TNHRGPGVQH IAFTAECIVQ AVDVIGARGV ELLSTPDAYY AALPARMDLA RYTVDELRSR SILVDSDHDG QLYQIFTKSV HPRNTIFLEI IERLGARGFG SGNIRALYEA VERTREQE
|
| |