Gene Mmar10_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0336 
Symbol 
ID4284996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp396801 
End bp397880 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content59% 
IMG OID638139799 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_755567 
Protein GI114568887 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.401058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00969902 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGACC TGTTTGAAAA CCCGATGGGA CTTTGCGGTT TCGAGTTTGT TGAGTTCACT 
GCGCCGGAAC GTGGCGTGAT CGAGCCGGTC TTCCAGGCCA TGGGCTTCAC CCATATCGCC
AATCACCGCT CCAAGGATGT CGAGCTGTGG CGCCAGGGCG GGATCAATTT CCTGATCAAT
TACGAGCCAG ACACTCAAGC CGCCTTCTTC GCCAAGGAAC ATGGGCCCTC GGCCTGCGGC
ATGGGCTTCC GGGTCAAGGA CGCGGCCCAG GCCTATGACG AATGCCTCGA GCGCGGCGCC
GAGCCGGTCA TGACCGAGCC GGGCATCTCC GAGCTGGTCA TCCCGGCGGT CAAGGGCATT
GGTGGTGCCT CGGTCTACCT GATCGACCGT TTCGAGGACG GCAAGTCGAT CTATGATATC
GACTTCAACT ATCTAGACGG CGTCGACATC CACCCCGAAG GCTGCGGCTT CAACGTGATC
GATCACCTGA CGCACAATGT CTACAAGGGC CGGATGGACT ATTGGGCCAA GTATTACGAG
GACCTCTTCA ACTTCCGCGA AATCCGCTAT TTCGACATCA AGGGCGAATA TACCGGCCTG
GTCTCCCGCG CCATGACGGC ACCGGACGGC CTGATCCGCA TCCCGCTGAA TGAAGAGAAA
TCCGACGCGG TCGGCCAGAT CGAGGAATAT CTGCGCGAGT ACAAGGGCGA AGGCATCCAG
CACATCGCCT TCTCCTGCGA CAATCTCATT GAATGCTGGG ACCGCCTGAA AAAGGCCGGC
ACCGAGTTCA TGACCGCGCC GCCGGAAACC TATTATGCAA TGCTGGAAGA TCGTCTGCCG
GGCCATGGCG AACCGACCGA AGAGTTCAAG AAGCGCGGCA TCCTGCTCGA CGGCACCACC
GAAGGCGGCC AGCCTCGCCT GCTGCTGCAG ATCTTCTCCG GCAAGGCGAT CGGTCCGATC
TTCTTCGAGT TCATCCAGCG CAAGGAAGAC GAAGGTTTCG GCGAGGGCAA TTTCAAGGCC
CTGTTCGAAT CCATCGAACG CGACCAGATC GAACGCGGCG TCATCAACGC AGCCGAGTAG
 
Protein sequence
MADLFENPMG LCGFEFVEFT APERGVIEPV FQAMGFTHIA NHRSKDVELW RQGGINFLIN 
YEPDTQAAFF AKEHGPSACG MGFRVKDAAQ AYDECLERGA EPVMTEPGIS ELVIPAVKGI
GGASVYLIDR FEDGKSIYDI DFNYLDGVDI HPEGCGFNVI DHLTHNVYKG RMDYWAKYYE
DLFNFREIRY FDIKGEYTGL VSRAMTAPDG LIRIPLNEEK SDAVGQIEEY LREYKGEGIQ
HIAFSCDNLI ECWDRLKKAG TEFMTAPPET YYAMLEDRLP GHGEPTEEFK KRGILLDGTT
EGGQPRLLLQ IFSGKAIGPI FFEFIQRKED EGFGEGNFKA LFESIERDQI ERGVINAAE