Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0336 |
Symbol | |
ID | 4284996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 396801 |
End bp | 397880 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638139799 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_755567 |
Protein GI | 114568887 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.401058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00969902 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGACC TGTTTGAAAA CCCGATGGGA CTTTGCGGTT TCGAGTTTGT TGAGTTCACT GCGCCGGAAC GTGGCGTGAT CGAGCCGGTC TTCCAGGCCA TGGGCTTCAC CCATATCGCC AATCACCGCT CCAAGGATGT CGAGCTGTGG CGCCAGGGCG GGATCAATTT CCTGATCAAT TACGAGCCAG ACACTCAAGC CGCCTTCTTC GCCAAGGAAC ATGGGCCCTC GGCCTGCGGC ATGGGCTTCC GGGTCAAGGA CGCGGCCCAG GCCTATGACG AATGCCTCGA GCGCGGCGCC GAGCCGGTCA TGACCGAGCC GGGCATCTCC GAGCTGGTCA TCCCGGCGGT CAAGGGCATT GGTGGTGCCT CGGTCTACCT GATCGACCGT TTCGAGGACG GCAAGTCGAT CTATGATATC GACTTCAACT ATCTAGACGG CGTCGACATC CACCCCGAAG GCTGCGGCTT CAACGTGATC GATCACCTGA CGCACAATGT CTACAAGGGC CGGATGGACT ATTGGGCCAA GTATTACGAG GACCTCTTCA ACTTCCGCGA AATCCGCTAT TTCGACATCA AGGGCGAATA TACCGGCCTG GTCTCCCGCG CCATGACGGC ACCGGACGGC CTGATCCGCA TCCCGCTGAA TGAAGAGAAA TCCGACGCGG TCGGCCAGAT CGAGGAATAT CTGCGCGAGT ACAAGGGCGA AGGCATCCAG CACATCGCCT TCTCCTGCGA CAATCTCATT GAATGCTGGG ACCGCCTGAA AAAGGCCGGC ACCGAGTTCA TGACCGCGCC GCCGGAAACC TATTATGCAA TGCTGGAAGA TCGTCTGCCG GGCCATGGCG AACCGACCGA AGAGTTCAAG AAGCGCGGCA TCCTGCTCGA CGGCACCACC GAAGGCGGCC AGCCTCGCCT GCTGCTGCAG ATCTTCTCCG GCAAGGCGAT CGGTCCGATC TTCTTCGAGT TCATCCAGCG CAAGGAAGAC GAAGGTTTCG GCGAGGGCAA TTTCAAGGCC CTGTTCGAAT CCATCGAACG CGACCAGATC GAACGCGGCG TCATCAACGC AGCCGAGTAG
|
Protein sequence | MADLFENPMG LCGFEFVEFT APERGVIEPV FQAMGFTHIA NHRSKDVELW RQGGINFLIN YEPDTQAAFF AKEHGPSACG MGFRVKDAAQ AYDECLERGA EPVMTEPGIS ELVIPAVKGI GGASVYLIDR FEDGKSIYDI DFNYLDGVDI HPEGCGFNVI DHLTHNVYKG RMDYWAKYYE DLFNFREIRY FDIKGEYTGL VSRAMTAPDG LIRIPLNEEK SDAVGQIEEY LREYKGEGIQ HIAFSCDNLI ECWDRLKKAG TEFMTAPPET YYAMLEDRLP GHGEPTEEFK KRGILLDGTT EGGQPRLLLQ IFSGKAIGPI FFEFIQRKED EGFGEGNFKA LFESIERDQI ERGVINAAE
|
| |