Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2673 |
Symbol | |
ID | 5706984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3048261 |
End bp | 3049370 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272131 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001537501 |
Protein GI | 159038248 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000192911 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATATCC GTGGAATAGA CCACATCGAA CTCTACGTGG GTGACGCCCG GCAGGCCGCC TTCTACTTCG GCAACGCGGT GGGAATGCGG CTGTGCGGCC AGGGTGGCCC GGAGACCGGA CTGACCGGGC AGCGTTCGTT GCTGCTGCGG CACGCTGGTG TCCGGTTGTT GCTGACCTCG GGGCTGACCG CCGACCATTC GGCGGCGGCG TACGTGCGAC GGCACGGCGA CGGTATCGCC GTGGTCGCGA TGGAGGTCGA CGACGCCGCC GGGGCGTACG CCGAACTGTT GGCCAGGGGT GCGACCGGCG GGACACCCCC GACCACCGTC ACCAGCGCCG ACGCCGAGGT CGTCGTTGCC GAGGTGGACG GTTTCGCCGA TGTGCGGCAC CGGCTGGTCG AGCGTCGCCG GGGCGGACCC GACTTCCTGC CGGGCCTGGC GGAGCTGCCG CCGGTGGACG ACACCGCCGA GAACCTGCTC GCCGAGATCG ACCACCTGGC GGTGTGTGTA CCGCCCGGGC AACTCGCCGA AACGGTCCGT GGCTACCGGG AGGTGTTCGG ATTCGCCGAG ATCTTCCACG AGTACGTGGA GGTCGACGGT CAGGCGATGA ACTCCACTGT GGTGCAGAGC CGGTCCGGGC GGGTGACGTT GGTGCTGCTC GAACCAGACA CCACGCGGCG GGCCGGGCAG ATCGACGCGT TCCTCACCCA GCACGCCGGT GCGGGGGTGC AGCACCTCGG GCTGCGCACC GACGACATCG TCGAAGCGGT CACCGCGCTG CGCCAGCGCG GGGTGGGATT CGCGCGTACC CCGGCGGCCT ACTACGACGA TCTGGAGACC CGGGTCGGCC GGGTCGACGG CTCACTGGAC CGGCTGCGGG AACTCGGCGT GCTGGTTGAC CGGGACCACG ACGGTCAGTT GCTGCAGATC TTCACAGAGT CGATGCACGT GCGCCGCACC CTCTTCCTCG AGTTGATCGA GCGGCGCGGG GCGCGGACCT TCGGCAGCGG CAACATCAAG GCGCTCTACG AGGCCAAAGA ACGGGAACTG GCCGTGGCGG GGGCGCTCCC CGCCGTCAGT GCGGCCACCG GCCAGGAGGT GACGGCATGA
|
Protein sequence | MDIRGIDHIE LYVGDARQAA FYFGNAVGMR LCGQGGPETG LTGQRSLLLR HAGVRLLLTS GLTADHSAAA YVRRHGDGIA VVAMEVDDAA GAYAELLARG ATGGTPPTTV TSADAEVVVA EVDGFADVRH RLVERRRGGP DFLPGLAELP PVDDTAENLL AEIDHLAVCV PPGQLAETVR GYREVFGFAE IFHEYVEVDG QAMNSTVVQS RSGRVTLVLL EPDTTRRAGQ IDAFLTQHAG AGVQHLGLRT DDIVEAVTAL RQRGVGFART PAAYYDDLET RVGRVDGSLD RLRELGVLVD RDHDGQLLQI FTESMHVRRT LFLELIERRG ARTFGSGNIK ALYEAKEREL AVAGALPAVS AATGQEVTA
|
| |