Gene Swit_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3864 
Symbol 
ID5199365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp4254090 
End bp4255418 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content68% 
IMG OID640583419 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001264347 
Protein GI148556765 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.412305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG TCGACCCCGC TCGCGGCTCC ACCGCCGCAA GGCCGGAGGA CATGATGACC 
GGTTCCTACA TTCCCGGCTT CGGCAACCAT GTCGCGACCG AGGCGGTTGC CGGCGCGCTG
CCGGTGGGGC GCAACTCGCC GCAGCGCGTG CCCTACGGCC TCTATGCCGA GCAGCTCTCG
GGCAGCGCCT TCACCGCGCC CCGCGCCGAG AACAAGCGGA GCTGGCTCTA CCGGATGCGT
CCGGCCGCCA ACCATCCGCC GTTCAAGGCC TATACCCGGC CGACCGGCCT GCGCCATCGT
CCGTTCGACG AGGCGCCGGC CGATCCCAAC CGGCTGCGCT GGGACCCGCT GCCGCTGCCG
TCCGAGCCGA CCGACTTCAT CGACGGGCTG GTCAGCATCG CCGGCAATGG CGGGATCGGC
GTCCATCTCT ATGCCGCCAA CCGCTCGATG GAGCGCCGCG TCTTCTTCAA TGCCGACGGC
GAGCTGCTGA TCGTTCCCGA ACAGGGCGGC CTGCGGATCG CCACCGAGCT CGGCCTGGTC
GAGGTCGAGC CGCTCCACAT CGCGGTGATC CCGCGCGGCG TCCGCTTCCG GGTCGAGCTG
ACCGGCGACG CGGCGCGCGG CTATGTCTGC GAGAATTACG GCGCGCCCTT CCGCCTGCCC
GACCTCGGCC CGATCGGATC GAACGGCCTC GCCAATCCGC GCGATTTCGA GACGCCGGCC
GCCTGGTTCG AGGATATCGA CGAGCCGACC GAGCTGGTCC AGAAGTTCGA AGGCGCGCTG
TGGTCGACGA CGATCGACCA TTCGCCGCTC GACGTGGTCG CCTGGCACGG CAATCTGGCA
CCCTATCGCT ACGACCTGCG CCGCTTCAAC ACGATCGGCA CGGTCAGCTA CGATCATCCC
GATCCGTCGA TCTTCACGGT GCTGACCTCG CCGTCGGAAG TGGCGGGCAC CGCCAATTGC
GACTTCGTGA TCTTCCCCCC GCGCTGGATG GTCGCCGAGG ACACGTTCCG GCCGCCCTGG
TTCCACCGCA ACGTGATGAG CGAATATATG GGCCTGATCA CCGGCGCCTA TGACGCCAAG
GCGGGCGGCT TCATGCCCGG CGGCGGATCG CTCCACAACC GCATGTCGGG CCACGGCCCC
GACCGCGCGA GCTACGAGAC GGCGATCGCC GCCGATCTCA AGCCGCACAA GATCGACGCC
ACCATGGCCT TCATGTTCGA AAGCCGCTCG GTGCTCCGCC CGACCCGGTT CGCGCTCGAA
AGCCCGCTCG CCCAGCTCGA TTATGACGAT TGCTGGTCGG GCTTCACCAA GGCAAAGCTT
CCCAGATGA
 
Protein sequence
MADVDPARGS TAARPEDMMT GSYIPGFGNH VATEAVAGAL PVGRNSPQRV PYGLYAEQLS 
GSAFTAPRAE NKRSWLYRMR PAANHPPFKA YTRPTGLRHR PFDEAPADPN RLRWDPLPLP
SEPTDFIDGL VSIAGNGGIG VHLYAANRSM ERRVFFNADG ELLIVPEQGG LRIATELGLV
EVEPLHIAVI PRGVRFRVEL TGDAARGYVC ENYGAPFRLP DLGPIGSNGL ANPRDFETPA
AWFEDIDEPT ELVQKFEGAL WSTTIDHSPL DVVAWHGNLA PYRYDLRRFN TIGTVSYDHP
DPSIFTVLTS PSEVAGTANC DFVIFPPRWM VAEDTFRPPW FHRNVMSEYM GLITGAYDAK
AGGFMPGGGS LHNRMSGHGP DRASYETAIA ADLKPHKIDA TMAFMFESRS VLRPTRFALE
SPLAQLDYDD CWSGFTKAKL PR