Gene Swit_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_1543 
Symbol 
ID5198875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp1717567 
End bp1718667 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID640581091 
Productvanillate monooxygenase 
Protein accessionYP_001262044 
Protein GI148554462 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.111008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00152746 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGCGAAG CCGAGGTTGC GAAGCTATCC GACCGTTTCG AAATATCCCG GCGCAGCGTC 
GCGAACGCGG AAAGCGGGTT CATCCTGAAC GCTTGGTATA TCGCCGGCCT GTCCAGCGAT
TTCGACCGGA AGCTCTCGAC GCGCACGATA TGCGGAAAGT CGATCCTGTT CTTCCGCGAC
GCGGACGGCG GCGTCCATGC CATGCAGAAC CGCTGCGGGC ACCGCTCCTA CCCTCTCTCC
GCCGGCACGC TCGACGGCAA TGAGATCGTC TGCGGCTATC ATGGCCTCAG ATACAATGCG
CAGGGCATCT GCGTGCAGCT TCCCGATGGC GGCTGCCCGC GCGGCAGGTT CGGGGTGCGG
AGCTATCCGC TGGTCGAGCG GGCCTCGTTC GTCTGGATCT GGGTCGGCGA TGCCGAACTG
GCCAAGTCCA CGCCCGTGCC GCATCCCGAA TGGCTGGGCG GGGAAGGCTG GTCCTTCGTC
ACCGGCTACA GCCATGCCGA AGGCAGCTAT GTCCATCTGC ACGAGAATCT GCTCGACCTG
TCGCACCTGA CCTATCTGCA TGCGGCGACG TTCGGCACGC CCGAGTTCGC GCTCGCCCCG
ATCGAGACCG AGATCAAGGA TGACGACATC CAGGTCTGGC GGAACGTCGA ATGCCAGTTG
CCGCCGATCT ATTCGATCCC GCTGGGATGG GAAGGCCAGC GCGCGCTGCG AAGGTCCGGG
TCACAGCTCG TTTCGCCCGG CCTCCATGTG AACACCGGCA TTTTCGAGAA TCTCGAACTG
GCCGAGCAGC CCGTGCCGAA GCCGATGGTG AAGGTGGCGC AGCTCATCAC GCCGGAAACC
CAGCATTCGA TTCATTATCA TTATGCGGTG GCCCGCAACT TCGCGCTCGA CGATGACGCC
GTCGGCGATC ATCTGCTCAA GGGATCGCAG GCGGCGTTCA GGGAGGACAT CGAGGCGCTG
CGCCGGATCA CGCAGATGCA CGCCGAGGCG GGCCCCGGGG GCGAAGTGTT CGAGTTCGAC
ATCCCGACCG ACAGGGCGGG GCTCGAGATG CGGCGCCGCT TCAAGAAGTT GATCGATCTC
GAGGAAGCAG CGGACGCCTA G
 
Protein sequence
MSEAEVAKLS DRFEISRRSV ANAESGFILN AWYIAGLSSD FDRKLSTRTI CGKSILFFRD 
ADGGVHAMQN RCGHRSYPLS AGTLDGNEIV CGYHGLRYNA QGICVQLPDG GCPRGRFGVR
SYPLVERASF VWIWVGDAEL AKSTPVPHPE WLGGEGWSFV TGYSHAEGSY VHLHENLLDL
SHLTYLHAAT FGTPEFALAP IETEIKDDDI QVWRNVECQL PPIYSIPLGW EGQRALRRSG
SQLVSPGLHV NTGIFENLEL AEQPVPKPMV KVAQLITPET QHSIHYHYAV ARNFALDDDA
VGDHLLKGSQ AAFREDIEAL RRITQMHAEA GPGGEVFEFD IPTDRAGLEM RRRFKKLIDL
EEAADA