Gene Smed_2836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2836 
Symbol 
ID5323706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2960806 
End bp2961918 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content59% 
IMG OID640791781 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001328501 
Protein GI150398034 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000134458 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCCAT TTCCACATGA CGCACCGCCG CCGGCGATCT CGGCGGACAA TCCCGCCGGA 
ACCGACGGGT TCGAGTTCGT CGAATTCGCC CATCCGGAAC CGGAGAAGCT TGCGGAACTC
TTCGGCCGCA TGGGCTACGC ACCGGTTGCC AGGCACAGGA CGAAGGACAT CACGATATGG
CGACAGGGCG ACATCAACTA TGTCCTGAAT GCTGAAGCCG GCTCGCATGC CATGCGCTTC
GTCGGAGAAC ACGGGCCCTG CGCCCCGTCG ATGGCCTGGC GCGTCGTCGA CGCGAAGCAT
GCCTTCGAGC ACGCCGTATC GAACGGCGCC GAGGCCTATA CCGGCAACGA CAAGAGCCTG
GACGTACCGG CGATCGTCGG CATCGGCGGC TCGCTTCTCT ATTTCGTGGA AGTTTACGGC
GAGAAAGGGT CCGCTTACGA TGCCGAGTTC GAATGGCTGC GCGAGCGTGA TCCGAAGCCG
GCCGGCGTCG GCTTCTATTA TCTCGACCAC CTGACCCACA ATGTCTATCG CGGCAATATG
GACAAGTGGT GGGCCTTCTA TCGCGAACTG TTCAATTTCA AACAGATCCA TTTCTTCGAC
ATCGACGGCC GCATCACCGG CCTCGTCAGC CGGGCGATCA CCTCACCTTG CGGCAAGATT
CGCATCCCAC TGAACGAATC GAAGGACGAC ACCAGCCAGA TCGAGGAATA TCTGACGAAG
TACAAAGGCG AAGGCATACA GCACATCGCG GTCGGTACCG AGGCGATCTA CGATGCGACC
GACAAACTCG CGGCAAACGG TCTGAAGTTC ATGCCGGGAC CGCCTGAAAC CTATTATGAG
ATGTCCCACC AGCGCGTTCG CGGACACGAC GAACCGATCG ACCGGATGAA GAAACATGGC
ATCCTGATCG ATGGAGAGGG TGTGGTGAAT GGCGGCATGA CGAAGATTCT GCTGCAGATC
TTCTCGCGCA CCGTGATCGG ACCAATCTTC TTCGAATTCA TTCAGCGCAA GGGTGACGAA
GGCTTCGGCG AGGGCAACTT CAGAGCATTG TTCGAATCGA TCGAGGCCGA CCAGATCCGC
CGCGGCGTAC TTGGCCACGA GGCGGCCGAG TAG
 
Protein sequence
MGPFPHDAPP PAISADNPAG TDGFEFVEFA HPEPEKLAEL FGRMGYAPVA RHRTKDITIW 
RQGDINYVLN AEAGSHAMRF VGEHGPCAPS MAWRVVDAKH AFEHAVSNGA EAYTGNDKSL
DVPAIVGIGG SLLYFVEVYG EKGSAYDAEF EWLRERDPKP AGVGFYYLDH LTHNVYRGNM
DKWWAFYREL FNFKQIHFFD IDGRITGLVS RAITSPCGKI RIPLNESKDD TSQIEEYLTK
YKGEGIQHIA VGTEAIYDAT DKLAANGLKF MPGPPETYYE MSHQRVRGHD EPIDRMKKHG
ILIDGEGVVN GGMTKILLQI FSRTVIGPIF FEFIQRKGDE GFGEGNFRAL FESIEADQIR
RGVLGHEAAE