Gene Smal_3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3739 
Symbol 
ID6474621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4206429 
End bp4207499 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content62% 
IMG OID642732940 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002030121 
Protein GI194367511 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.368699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTCA CCACCTTCGA GAACCCGATG GGCATCGACG GCTTCGAGTT CGTCGAATTC 
GCCGCCCCGG CCGGCCGTGG CCAGGAGCTG CACGAGTACT TCCGGAAGAT GGGCTTCAGC
GCGGTGCTCA AGCACAAGCA GCGTCCGATT ACCGTCTATC GCCAGGGCGA CGTCAACTTC
CTGGTCAATG AAGACCCGGA TTCGTTCGCT TCGGACTTCG CCGAAAAGCA CGGCCCGTGC
GCCTGCGGCT TCGCCATCCG CTTCAAGAAG CCGGGCCAGG AGGTCTACCA GACCGCGCTG
GGCAACGGCG CCGAAGCCAT CGCCTTCAAG CCGGACAGCA AGGCGGTCAG CGCGCCGGTC
ATCAAGGGCA TCGGCGACTG CATGCTGTAC CTGGTCGACC GCTACGGCAG CGCCGGCAGC
ATCTTCGATG GCGACTACGA GCTGATCGCC GGCGCCGAAC TGCGCCCGAA GGGCTTCGGC
TTGACCTTCA TCGACCACCT GACCCACAAC CTGTACTTCG GCAACATGCA GCAGTGGTCG
GACTACTACG AGCGCCTGTT CAACTTCCGC GAGATCCGCT ACTTCGACAT CAAGGGCCTG
AAGACCGGCC TGGTGTCCAA GGCGATGACC GCGCCGGACG GCATCGTGCG CATTCCGCTG
AATGAATCGT CCGACCCGAA GAGCCAGATC AACGAGTACC TGGATGCGTA CAAGGGCGAA
GGCATCCAGC ACATCGCCTG CTTCACCGAG AACATCTACG AGACCGTCGA AGCGATGCGT
GCGCAGGGCG TGGACTTCCT CGACACTCCG GAGACCTACT TCGACGTGAT CGACCAGCGC
GTGCCGAACC ACGGTGAAGA CGTGGCGCGC CTGGCCAAGA ACAAGATCCT GATCGACGCT
GATCCGGAAA CCCACCAGCG CAAGCTGCTG CAGATCTTCA CCCAGAACTG CATCGGCCCG
ATCTTCTTCG AGATCATCCA GCGCAAGGGC AACGAAGGCT TTGGCGAAGG CAACTTCACC
GCGCTGTTCG AAAGCATCGA GCGCGACCAG ATCCGCCGCG GCGTGCTGTA A
 
Protein sequence
MQVTTFENPM GIDGFEFVEF AAPAGRGQEL HEYFRKMGFS AVLKHKQRPI TVYRQGDVNF 
LVNEDPDSFA SDFAEKHGPC ACGFAIRFKK PGQEVYQTAL GNGAEAIAFK PDSKAVSAPV
IKGIGDCMLY LVDRYGSAGS IFDGDYELIA GAELRPKGFG LTFIDHLTHN LYFGNMQQWS
DYYERLFNFR EIRYFDIKGL KTGLVSKAMT APDGIVRIPL NESSDPKSQI NEYLDAYKGE
GIQHIACFTE NIYETVEAMR AQGVDFLDTP ETYFDVIDQR VPNHGEDVAR LAKNKILIDA
DPETHQRKLL QIFTQNCIGP IFFEIIQRKG NEGFGEGNFT ALFESIERDQ IRRGVL