Gene Smal_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3738 
Symbol 
ID6474620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4205028 
End bp4206326 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID642732939 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_002030120 
Protein GI194367510 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.333429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCCG CCATCACCGC CCGCGGCTAC CAGTCCGGCT TCGGCAACGA ATTCGCCACC 
GAGGCCGTCG CCGGCGCGCT GCCGGTCGGG CAGAACTCGC CGCAGAAGGT GGCCCACGGC
CTGTACGCCG AGCAGTTGAC CGGCACCGCG TTCACCGCGC CGCGTGGCAG CAATCGCCGC
AGCTGGCTGT ACCGGATCCG CCCGGCGGTA ACCCATGGTG AGTTCACCCC GTTCGCGCAG
TCGCAGCTGC AGTGCGATTT CGCTGCGCAG CCGGCGTCGC CGAACCAGCT GCGCTGGAGC
CCGTTGCCGC TGCCGGAGCT GCCGACCGAC TTTGTCGAAG GTCTGTATAC GATGGGTGGC
AACGGCTCGC CGGATGCGCA TGCCGGTGTG GGTATCCACC TCTACGCCGC CAACCGCGAC
ATGGTCGGCC GCTATTTCTA CGATGCCGAT GGCGAACTGC TGATCGTGCC GCAGCTGGGC
GCGCTGCGCC TGTTGACCGA GCTGGGCGTG ATCGAGATCG AGCCGCAGCA GATCGCGGTG
ATCCCGCGTG GCGTGCGGTT CCGCGTCGAA CTGCCCGATG GCCCGAGCCG CGGCTACATC
TGCGAGAACT ACGGTGCGCT GCTGAAGCTG CCTGACCTCG GCCCGATCGG CTCCAATGGC
CTGGCCAACC CGCGCGACTT CGAAACTCCG CACGCGGCGT TCGAGGATGT TGACGGTGAT
TTCGAGCTGA TCGCCAAGTT CGAGGGCCGC CTGTGGCGCG CGCCGATCGA CCATTCGCCG
CTGGACGTGG TGGCCTGGCA CGGCAACTAC GCGCCGTACC GCTACGACCT GCGCCGCTTC
AACACCATCG GCTCGATCAG CCATGACCAT CCGGACCCGT CGATCTTCCT GGTGCTGCAC
TCGCCCAGCG ACACGCCGGG GACCAGCAAC ATGGACTTCG CGATCTTCCC ACCGCGCTGG
CTGGTAGCAC AGAACACCTT CCGTCCGCCG TGGTTCCACC GCAACATCGC CAGCGAGTTC
ATGGGCCTGG TGCATGGCGC CTACGACGCC AAGGCCGAAG GCTTCGTGCC CGGCGGCGCC
TCGCTGCACA ACTGCATGAG CGGCCACGGC CCGGATGCGC CGACCTTCGA CAAGGCCTCC
AACGCGGACC TGTCCAAGCC GGACGTGATC AAGGACACGA TGGCCTTCAT GTTCGAGACC
CGCGCGGTGA TCCGCCCGAC CGCGCAGGCC TTGGCTGCCG GCCATCGGCA GGGCGATTAC
CAGCAGTGCT GGAACGGCCT GCGTAACAAC TACCGCTGA
 
Protein sequence
MSPAITARGY QSGFGNEFAT EAVAGALPVG QNSPQKVAHG LYAEQLTGTA FTAPRGSNRR 
SWLYRIRPAV THGEFTPFAQ SQLQCDFAAQ PASPNQLRWS PLPLPELPTD FVEGLYTMGG
NGSPDAHAGV GIHLYAANRD MVGRYFYDAD GELLIVPQLG ALRLLTELGV IEIEPQQIAV
IPRGVRFRVE LPDGPSRGYI CENYGALLKL PDLGPIGSNG LANPRDFETP HAAFEDVDGD
FELIAKFEGR LWRAPIDHSP LDVVAWHGNY APYRYDLRRF NTIGSISHDH PDPSIFLVLH
SPSDTPGTSN MDFAIFPPRW LVAQNTFRPP WFHRNIASEF MGLVHGAYDA KAEGFVPGGA
SLHNCMSGHG PDAPTFDKAS NADLSKPDVI KDTMAFMFET RAVIRPTAQA LAAGHRQGDY
QQCWNGLRNN YR