Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3864 |
Symbol | |
ID | 5199365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 4254090 |
End bp | 4255418 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640583419 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001264347 |
Protein GI | 148556765 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.412305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACG TCGACCCCGC TCGCGGCTCC ACCGCCGCAA GGCCGGAGGA CATGATGACC GGTTCCTACA TTCCCGGCTT CGGCAACCAT GTCGCGACCG AGGCGGTTGC CGGCGCGCTG CCGGTGGGGC GCAACTCGCC GCAGCGCGTG CCCTACGGCC TCTATGCCGA GCAGCTCTCG GGCAGCGCCT TCACCGCGCC CCGCGCCGAG AACAAGCGGA GCTGGCTCTA CCGGATGCGT CCGGCCGCCA ACCATCCGCC GTTCAAGGCC TATACCCGGC CGACCGGCCT GCGCCATCGT CCGTTCGACG AGGCGCCGGC CGATCCCAAC CGGCTGCGCT GGGACCCGCT GCCGCTGCCG TCCGAGCCGA CCGACTTCAT CGACGGGCTG GTCAGCATCG CCGGCAATGG CGGGATCGGC GTCCATCTCT ATGCCGCCAA CCGCTCGATG GAGCGCCGCG TCTTCTTCAA TGCCGACGGC GAGCTGCTGA TCGTTCCCGA ACAGGGCGGC CTGCGGATCG CCACCGAGCT CGGCCTGGTC GAGGTCGAGC CGCTCCACAT CGCGGTGATC CCGCGCGGCG TCCGCTTCCG GGTCGAGCTG ACCGGCGACG CGGCGCGCGG CTATGTCTGC GAGAATTACG GCGCGCCCTT CCGCCTGCCC GACCTCGGCC CGATCGGATC GAACGGCCTC GCCAATCCGC GCGATTTCGA GACGCCGGCC GCCTGGTTCG AGGATATCGA CGAGCCGACC GAGCTGGTCC AGAAGTTCGA AGGCGCGCTG TGGTCGACGA CGATCGACCA TTCGCCGCTC GACGTGGTCG CCTGGCACGG CAATCTGGCA CCCTATCGCT ACGACCTGCG CCGCTTCAAC ACGATCGGCA CGGTCAGCTA CGATCATCCC GATCCGTCGA TCTTCACGGT GCTGACCTCG CCGTCGGAAG TGGCGGGCAC CGCCAATTGC GACTTCGTGA TCTTCCCCCC GCGCTGGATG GTCGCCGAGG ACACGTTCCG GCCGCCCTGG TTCCACCGCA ACGTGATGAG CGAATATATG GGCCTGATCA CCGGCGCCTA TGACGCCAAG GCGGGCGGCT TCATGCCCGG CGGCGGATCG CTCCACAACC GCATGTCGGG CCACGGCCCC GACCGCGCGA GCTACGAGAC GGCGATCGCC GCCGATCTCA AGCCGCACAA GATCGACGCC ACCATGGCCT TCATGTTCGA AAGCCGCTCG GTGCTCCGCC CGACCCGGTT CGCGCTCGAA AGCCCGCTCG CCCAGCTCGA TTATGACGAT TGCTGGTCGG GCTTCACCAA GGCAAAGCTT CCCAGATGA
|
Protein sequence | MADVDPARGS TAARPEDMMT GSYIPGFGNH VATEAVAGAL PVGRNSPQRV PYGLYAEQLS GSAFTAPRAE NKRSWLYRMR PAANHPPFKA YTRPTGLRHR PFDEAPADPN RLRWDPLPLP SEPTDFIDGL VSIAGNGGIG VHLYAANRSM ERRVFFNADG ELLIVPEQGG LRIATELGLV EVEPLHIAVI PRGVRFRVEL TGDAARGYVC ENYGAPFRLP DLGPIGSNGL ANPRDFETPA AWFEDIDEPT ELVQKFEGAL WSTTIDHSPL DVVAWHGNLA PYRYDLRRFN TIGTVSYDHP DPSIFTVLTS PSEVAGTANC DFVIFPPRWM VAEDTFRPPW FHRNVMSEYM GLITGAYDAK AGGFMPGGGS LHNRMSGHGP DRASYETAIA ADLKPHKIDA TMAFMFESRS VLRPTRFALE SPLAQLDYDD CWSGFTKAKL PR
|
| |