Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smal_3738 |
Symbol | |
ID | 6474620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stenotrophomonas maltophilia R551-3 |
Kingdom | Bacteria |
Replicon accession | NC_011071 |
Strand | - |
Start bp | 4205028 |
End bp | 4206326 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642732939 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_002030120 |
Protein GI | 194367510 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.333429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCCCG CCATCACCGC CCGCGGCTAC CAGTCCGGCT TCGGCAACGA ATTCGCCACC GAGGCCGTCG CCGGCGCGCT GCCGGTCGGG CAGAACTCGC CGCAGAAGGT GGCCCACGGC CTGTACGCCG AGCAGTTGAC CGGCACCGCG TTCACCGCGC CGCGTGGCAG CAATCGCCGC AGCTGGCTGT ACCGGATCCG CCCGGCGGTA ACCCATGGTG AGTTCACCCC GTTCGCGCAG TCGCAGCTGC AGTGCGATTT CGCTGCGCAG CCGGCGTCGC CGAACCAGCT GCGCTGGAGC CCGTTGCCGC TGCCGGAGCT GCCGACCGAC TTTGTCGAAG GTCTGTATAC GATGGGTGGC AACGGCTCGC CGGATGCGCA TGCCGGTGTG GGTATCCACC TCTACGCCGC CAACCGCGAC ATGGTCGGCC GCTATTTCTA CGATGCCGAT GGCGAACTGC TGATCGTGCC GCAGCTGGGC GCGCTGCGCC TGTTGACCGA GCTGGGCGTG ATCGAGATCG AGCCGCAGCA GATCGCGGTG ATCCCGCGTG GCGTGCGGTT CCGCGTCGAA CTGCCCGATG GCCCGAGCCG CGGCTACATC TGCGAGAACT ACGGTGCGCT GCTGAAGCTG CCTGACCTCG GCCCGATCGG CTCCAATGGC CTGGCCAACC CGCGCGACTT CGAAACTCCG CACGCGGCGT TCGAGGATGT TGACGGTGAT TTCGAGCTGA TCGCCAAGTT CGAGGGCCGC CTGTGGCGCG CGCCGATCGA CCATTCGCCG CTGGACGTGG TGGCCTGGCA CGGCAACTAC GCGCCGTACC GCTACGACCT GCGCCGCTTC AACACCATCG GCTCGATCAG CCATGACCAT CCGGACCCGT CGATCTTCCT GGTGCTGCAC TCGCCCAGCG ACACGCCGGG GACCAGCAAC ATGGACTTCG CGATCTTCCC ACCGCGCTGG CTGGTAGCAC AGAACACCTT CCGTCCGCCG TGGTTCCACC GCAACATCGC CAGCGAGTTC ATGGGCCTGG TGCATGGCGC CTACGACGCC AAGGCCGAAG GCTTCGTGCC CGGCGGCGCC TCGCTGCACA ACTGCATGAG CGGCCACGGC CCGGATGCGC CGACCTTCGA CAAGGCCTCC AACGCGGACC TGTCCAAGCC GGACGTGATC AAGGACACGA TGGCCTTCAT GTTCGAGACC CGCGCGGTGA TCCGCCCGAC CGCGCAGGCC TTGGCTGCCG GCCATCGGCA GGGCGATTAC CAGCAGTGCT GGAACGGCCT GCGTAACAAC TACCGCTGA
|
Protein sequence | MSPAITARGY QSGFGNEFAT EAVAGALPVG QNSPQKVAHG LYAEQLTGTA FTAPRGSNRR SWLYRIRPAV THGEFTPFAQ SQLQCDFAAQ PASPNQLRWS PLPLPELPTD FVEGLYTMGG NGSPDAHAGV GIHLYAANRD MVGRYFYDAD GELLIVPQLG ALRLLTELGV IEIEPQQIAV IPRGVRFRVE LPDGPSRGYI CENYGALLKL PDLGPIGSNG LANPRDFETP HAAFEDVDGD FELIAKFEGR LWRAPIDHSP LDVVAWHGNY APYRYDLRRF NTIGSISHDH PDPSIFLVLH SPSDTPGTSN MDFAIFPPRW LVAQNTFRPP WFHRNIASEF MGLVHGAYDA KAEGFVPGGA SLHNCMSGHG PDAPTFDKAS NADLSKPDVI KDTMAFMFET RAVIRPTAQA LAAGHRQGDY QQCWNGLRNN YR
|
| |