Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3086 |
Symbol | |
ID | 5196984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3386506 |
End bp | 3387606 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640582635 |
Product | gentisate 1 2-dioxygenase-like protein |
Protein accession | YP_001263574 |
Protein GI | 148555992 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.407832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAC CCGCCGCGAA TTACAGCTTT GTTCCCGCAC AGGCCGGCCA TGCGGTCCCG GACCGCTGGC AGCCGATCAA GGTTCCGCGC GCCGAGATCG CGGCCGAGAT CGATCGGCTC GCCGAGCGCG CCGTCGAGCC CGGCGCGCGC CGGAGCTCCG AGATCGTCCA TGCCAATGCG CTCGATCTGG GCAGGGGCAT CACGCCCGGC CTGTCCATCG CGATCCAGGT GCTGCGGCCC GGCGAGACGA TCATGATCGA TCGCGACAAT GCCAACCGGG TCGAGTTCTG CCTCGGCGGG GAAGGCGAAG CGCTGCTGGG GCAGCGAAGC TTCCGGGTCG AGAAATGGTC GGCATGGATG GTGCCGTCGA TGACGCCGCG CCTCTATCGC AACAGCGGCC AGCAGCTCCT CATCTGGCTG AGCTATTCCA ACCAGCCCCT GCTCGAACGG GCCGGCGTCT ATTATGCGGA CGCGGTGCAT GACCATCCGC AGGCCGCCCC GCGCAGGCCG GGCATCAACC AATATGCGCG CGACACCGCG CCGGACATCG CCATATCGGA CGATGGCGCG CGCTTGCGCG GCTACGAATT CCTCACCGAT ATCGAGGTGG TGGACAATCC GCCGCTGCTC TGGCCCTGGC GGGAGATGCT GCCGCATCTG TCGCGCCGGC AGGGCGACGA CAAGCGCACG ATCATGCTGA TGTACAACCC CGCGACCGGC CGCAGGAACG GGACGACCCA CAGCTTCTTC GCGACGATCA CGTCCTTTCC GGCCGGTCCG CTGCGCCCGC CGCCGCCGCG CGGCCATCGC CACAGCAGCT TCGCGACCAA CTATCATTTC GAAGGCGCCG GCGAGAGCGT GGTCGACGGA CAGAGATTCG AGTGGGAAGC GGGCGATCTG ATGCTTTCGG CGCCGAGCTG GGCGGAGCAT TCGCACGGCA TTTCCGAACG TGGCGCCAGC GTGCTGACCG TTCAGGACCA CCCCTTTCAG ATCGGCATGG GAGCGCTGAT CTGGCAGGAG GACATGGCCG GTCCGGTCCT GACGCTGGGC TCCGAACCCG GCCAGACCGG CTATGTCGGC CCGCGCCTGG CCGGCGAATA G
|
Protein sequence | MNEPAANYSF VPAQAGHAVP DRWQPIKVPR AEIAAEIDRL AERAVEPGAR RSSEIVHANA LDLGRGITPG LSIAIQVLRP GETIMIDRDN ANRVEFCLGG EGEALLGQRS FRVEKWSAWM VPSMTPRLYR NSGQQLLIWL SYSNQPLLER AGVYYADAVH DHPQAAPRRP GINQYARDTA PDIAISDDGA RLRGYEFLTD IEVVDNPPLL WPWREMLPHL SRRQGDDKRT IMLMYNPATG RRNGTTHSFF ATITSFPAGP LRPPPPRGHR HSSFATNYHF EGAGESVVDG QRFEWEAGDL MLSAPSWAEH SHGISERGAS VLTVQDHPFQ IGMGALIWQE DMAGPVLTLG SEPGQTGYVG PRLAGE
|
| |