Gene Swit_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3059 
Symbol 
ID5198609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3356580 
End bp3357629 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID640582608 
Productgentisate 1,2-dioxygenase 
Protein accessionYP_001263547 
Protein GI148555965 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0227445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCAG AAACGGCCCC CGAAACCATG GCGGCCTTCT ACGCGGAGCT CGACGGCCAG 
AACATGGCGC CGCTGTGGGA AAGCCTGCAC AGTCTGGTGC CGCGCCAGCC GGCGCCGGTC
ATCCAGGCCG CCCATTGGGA CTATGACGCC GTCGTCCGCC CGCGCCTGAT GGAGGCGGGC
CGGCTGATCA CCGCGAAGAA GGCCGAGCGC CGCGTGCTCA TCCTCGAAAA TCCGGGGCTG
CGCGGCAAGG CCTCGATCAC CCAGTCGCTC TATGCCGGGT TGCAGCTCAT CCTGCCGGGC
GAGGTCGCGC CGGCGCATCG GCACACCCAA TGCGCGCTGC GTTTCATCGT CGAGGGGGAG
GGCGCGCACA CCACCGTGTC GGGCGAGCGT ACGATCATGC ATCCCGGCGA CTTCGTGCTG
ACGCCGAACT GGACCTGGCA CGACCATGGC AATGAGAGCG ACGCGCCGAT GGTGTGGCTC
GACGGGCTCG ACATTCCGAT CGTCGCCTTC CTCGACGCCG GTTTCGCCGA GGCCGGCAAT
GCCGACAGCC AGCCAACCGT TCGCCCCGAC GGCGACGCGG AGGCGCGGTT CGGGGGCACG
CTGCTGCCCG TGGACTGGCG GGCGTCGTCG CGGAACTCGC CGGTGCTCAA CTATCCCTAT
GCGCGGTCGC GGGAGACGCT GCACCGGCTG GAACGCAACG GCGAGGCCGA CGCCAGCCAC
GGATACAAGC TGCGCTACGT CAATCCGGCC GACGGCGGCT GGCCGATGCC GACGATCGGC
GCGTTCATCC AGTTCCTGCC GGGCGGTTTT CGCACCGCGC CCTACCGGTC GACCGACAGC
ACCGTCTATG CGGTGGTCGA GGGGCATGGC GAAAGCATCG TCGGCGATCG GCGCATCCGC
TGGAAACCGC GCGACATCTT CGTCGCGCCG AGCTGGCAAT GGCAGGAGCA TGCCGCGAGC
GGCGACGCGG TGCTGTTCAG TTTCTCCGAC CGTCCCGTTC AGGAGGGTCT CGGCCTGTGG
CGCGAAGAAC GGGGCATTCC CCGCCGCTGA
 
Protein sequence
MEAETAPETM AAFYAELDGQ NMAPLWESLH SLVPRQPAPV IQAAHWDYDA VVRPRLMEAG 
RLITAKKAER RVLILENPGL RGKASITQSL YAGLQLILPG EVAPAHRHTQ CALRFIVEGE
GAHTTVSGER TIMHPGDFVL TPNWTWHDHG NESDAPMVWL DGLDIPIVAF LDAGFAEAGN
ADSQPTVRPD GDAEARFGGT LLPVDWRASS RNSPVLNYPY ARSRETLHRL ERNGEADASH
GYKLRYVNPA DGGWPMPTIG AFIQFLPGGF RTAPYRSTDS TVYAVVEGHG ESIVGDRRIR
WKPRDIFVAP SWQWQEHAAS GDAVLFSFSD RPVQEGLGLW REERGIPRR