Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Svir_20700 |
Symbol | |
ID | 8387394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharomonospora viridis DSM 43017 |
Kingdom | Bacteria |
Replicon accession | NC_013159 |
Strand | - |
Start bp | 2203170 |
End bp | 2204288 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644976128 |
Product | gentisate 1,2-dioxygenase |
Protein accession | YP_003133910 |
Protein GI | 257056078 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.280371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.141412 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGACG AACTGGACCA GGCCGTCGAC CACGCCTCGG TCACCAACGA GATGACCCAG GGCGACAGCC CCGAGTTGAC CCAGCTGTAC CGGGACTTCG ACGCCCACCA TCTCATTCCG CTGTGGACGC AGATCGGCGA CCTGATGCCG ATGCACCCCA AGCCCGAGGC CGTGGCGCAC GTGTGGAAGT GGTCCACGTT GTATCCGCTG GCGCAGCGGG CGGGAGACCT CGTCCCGGTC GGACGGGGCG GCGAACGGCG GGCGATCGCG TTGGCCAACC CCGGCCTGCC CGGCTCCGCG TACGCCACCC CCACGCTGTG GGCGGCGATC CAGTACCTCG GTCCCAAGGA GACCGCACCC GAGCATCGGC ACTCGCAGAA CGCGTTCCGT TTCGTCCTCG AGGGCGAGGG GGTGTGGACG GTCGTCGACG GTGACCCCGT GCGGATGTCC CGTGGTGACT TCCTGCTGAC CCCCGGATGG CGCTTCCACG GCCACCACAA CGAGACGGAC CAGCCGATGG CGTGGCTCGA CGGGTTGGAC ATCCCGTTCT CCTACTACAC CGACGTCGGG TTCTTCGAAT TCGGCGCCGA CCGTGTCACC GACTACGCCA CGCCCAACTA CTCCCGTAGT GAACGACTGT GGTGCCACCC CGGGCTGCGC CCGCTGTCGG GGCTGCAGAA CACGGTGTCC TCGCCGATCG GGGCGTACCG GTGGGAGCAC ACCGACGCCG CGTTGACCGA ACAGCTGCTG TTGGAGGACG AGGGCCAGCC CGCCACCGTC GAACAGGGAC ACGCCGCCGT CCGCTACATC AACCCCACCA CCGGTGGCGA TGTCATGCCG ACGATCCGCG CCGAGTTCCA CCGGCTGCGG GCCGGGGCGG CGACACCCGC CCGGCGCGAG GTGGGCTCCA GCGTGTTCCA GGTGTTCGAA GGCCGCGGCC AGGTCGTGCT CGACGGTGTC GAGCATCCGC TGGAGAAGGG TGACCTGTTC GTGGTGCCCT CGTGGGTGCC GTGGTCGTTG CAGGCCGAGG AGCAGTTCGA CCTGTTCCGT TTCTCCGACG CCCCGATCAT GGAGCGTCTG CACTTCAACC GTGTTCACGT CGAAGGAGAG CAGCGATGA
|
Protein sequence | MSDELDQAVD HASVTNEMTQ GDSPELTQLY RDFDAHHLIP LWTQIGDLMP MHPKPEAVAH VWKWSTLYPL AQRAGDLVPV GRGGERRAIA LANPGLPGSA YATPTLWAAI QYLGPKETAP EHRHSQNAFR FVLEGEGVWT VVDGDPVRMS RGDFLLTPGW RFHGHHNETD QPMAWLDGLD IPFSYYTDVG FFEFGADRVT DYATPNYSRS ERLWCHPGLR PLSGLQNTVS SPIGAYRWEH TDAALTEQLL LEDEGQPATV EQGHAAVRYI NPTTGGDVMP TIRAEFHRLR AGAATPARRE VGSSVFQVFE GRGQVVLDGV EHPLEKGDLF VVPSWVPWSL QAEEQFDLFR FSDAPIMERL HFNRVHVEGE QR
|
| |