Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4680 |
Symbol | |
ID | 5704307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5302284 |
End bp | 5303489 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641274078 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001539424 |
Protein GI | 159040171 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.211865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.133112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAG CGATCGACCG ACCCCAGTCG AGCGACGAGG TCGACGCCGA CCTGCTGGTC GGCGCCGTAG ACCACGACAT CAGCCGGGAT CCGTTCCCGG TCAGGGGCCT CGACCACGTG CACTTCCTGG TGGGCAACGC CAAACAGGCC GCGCACTACT ACTCCACCGC GTTCGGCATG ACGTGCGTGG CCTACCGGGG GCCGGAACAG GGCCACCGAG ACCACGCCCA GTACGTCCTG ACCAGTGGCT CGGCCCGGTT CGTTCTCACC GGCACGGTCC GCCCCGACGC GGCGGGTGCC GAGCAGGTCG CCCGGCACAG CGACGGCGTC TCCGACATCG CACTGGAGGT CCCGGACGTC GACGCGGCGT ACGCGCACGC CATCGCCCAG GGCGCGAGCG GTCTGGCGGA GCCGTACGAC GTCAGCGACG AACACGGCAC CGTCCGGCTG GCAGCCATCG CGACGTACGG CGACACCCGC CACACCCTGG TCGACCGCTC CCGTTACCGC GGTCCGTTCC TGCCCGGCTA CGTCGCCCGG CAGCCGATCG TCGATCGTCA GCCGATGGTC AACGCAGGTC TCCAGCCCAA GCGCTTCTTC CAGGCGATCG ACCACATCGT CGGCAACGTC GAGCTGGGCC GCATGGACGA GTGGGTCGAG TTCTACCGGC GTGTGATGGG CTTCACCAAC ATGGCGGAGT TCGTCGGCGA CGACATCGCC ACCGACTATT CGGCGCTGAT GAGCAAGGTG GTCGCCAACG GCACCCGGAA GGTGAAGTTC CCGCTCAACG AGCCGGCGGT CGCCCGGAAG AAGTCGCAGA TCGACGAGTA CCTGGAGTTC TATCAGGGCC CGGGAGCCCA GCACATCGCG GTGGCCACCA ACGACATTCT GGCCAGCGTG GACGCGATGC GCGCGGCCGG GGTCGAGTTC CTGGACACCC CGGACTCGTA CTACGACGAC CCGGAACTAC GTGCCCGGAT CGGTGAGGTG CGGGTGCCGA TCGAGGAGCT GAAGGCCCGC GGGATCCTGG TTGACCGGGA CGAGGACGGC TACCTGCTCC AGATCTTCAC CAAGCCGGTG CAGGACCGCC CAACCGTCTT CTTCGAGCTG ATCGAGCGAC ACGGCTCACT CGGCTTCGGC AAGGGCAACT TCAAGGCACT CTTCGAGGCC ATCGAACGGG AACAGGAGAA GCGCGGCAAC CTGTGA
|
Protein sequence | MTQAIDRPQS SDEVDADLLV GAVDHDISRD PFPVRGLDHV HFLVGNAKQA AHYYSTAFGM TCVAYRGPEQ GHRDHAQYVL TSGSARFVLT GTVRPDAAGA EQVARHSDGV SDIALEVPDV DAAYAHAIAQ GASGLAEPYD VSDEHGTVRL AAIATYGDTR HTLVDRSRYR GPFLPGYVAR QPIVDRQPMV NAGLQPKRFF QAIDHIVGNV ELGRMDEWVE FYRRVMGFTN MAEFVGDDIA TDYSALMSKV VANGTRKVKF PLNEPAVARK KSQIDEYLEF YQGPGAQHIA VATNDILASV DAMRAAGVEF LDTPDSYYDD PELRARIGEV RVPIEELKAR GILVDRDEDG YLLQIFTKPV QDRPTVFFEL IERHGSLGFG KGNFKALFEA IEREQEKRGN L
|
| |