Gene Swit_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3024 
Symbol 
ID5197150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3310111 
End bp3311196 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content65% 
IMG OID640582573 
Productvanillate monooxygenase 
Protein accessionYP_001263512 
Protein GI148555930 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCCA GGAATTGCTG GTATGTCGCT GCGATGAGCC ACGAGATCGC GCCGGGCGCG 
GTGATCTCGC GCATGATCTG CGACGTGCCG ATGGCGATCT TCCGCTCGCT CGGCGGGGTG
GCGAGCATCC TGCTCGACCG CTGCCCGCAC CGCCTCTACC CGCTGTCGGA AGGAACCGTC
GAAGCGCATG GCCTGCGCTG CGCCTATCAT GGCCTGGTGT TCGCCGGCGA CGGCGGCTGC
CGCGAGATTC CGAGCCAATC CGAAATTCCG GCCAACGCCT GTGTGACCGC CTATCCGGCG
CGCGAGAGCC ACGGCCTGAT CTGGGTCTGG CCGGGCGACG CGGCGCTGGC GGCCTCGACG
CCGTTGCCGG CGTTCGAAAC GGGGGAGGGT TATCTCGCGG GCCTCGATTT CAGCTGCCTC
GATCCGTCGA GCCTCTGGGG GGTGGCCGGG CCGCACGTCA TTCATCTCGA CGCGAACTAC
ATGCTCGCCG TCGACAATCT GCTCGACCTG ACCCACACCG CCTTCGTCCA CGCCAAGACG
TTCGACAATG CCGGCGTGCT GGACTCGACA CGGACGGTGA GGGCGACCGG CGATCAGCAA
CTGATCGACT TCTTCGCGTT CAAGAACAGC ATGTCGGCGC CGCTGCGCAA CGGCTATATC
CTCGACGAGG GCGTGCCGCT GTTCGACAAC TATCTCGAAA CCTACTGGCA GGCGCCAGGC
GTCATGATCC TGGTCCATGG CGCGGTGCCC GAAGGCGGCG ACCGCGAACG GGACGGGGCG
ATCGTGCTCA ACACCAATAT CCTGACCCCG GCGACCGAGA GGAGCTGCCA CTATTTCTGG
GCGCAGTCGG TCTATCGCGA TCGCGGGAAC GGCAAGGTGC GCGACCTGTG GGAGATCATG
ACCAAGGCCG CCTTCGCCGA GGACGAACAT ACGCTGCAGC GACAGCAGGC CAATCTGGAA
CGGTTCGGCG CCAGCGCGCT CGCCGACGAT GTCTCGCTGA TCCTCAAGGC CGACAAGGCG
ATCGTTCTGG CGCGTCGCAT GGTTAGCCGT ATGGTGCGGG GGGAAGAAGC TGCTAAAGCG
GCCTGA
 
Protein sequence
MFPRNCWYVA AMSHEIAPGA VISRMICDVP MAIFRSLGGV ASILLDRCPH RLYPLSEGTV 
EAHGLRCAYH GLVFAGDGGC REIPSQSEIP ANACVTAYPA RESHGLIWVW PGDAALAAST
PLPAFETGEG YLAGLDFSCL DPSSLWGVAG PHVIHLDANY MLAVDNLLDL THTAFVHAKT
FDNAGVLDST RTVRATGDQQ LIDFFAFKNS MSAPLRNGYI LDEGVPLFDN YLETYWQAPG
VMILVHGAVP EGGDRERDGA IVLNTNILTP ATERSCHYFW AQSVYRDRGN GKVRDLWEIM
TKAAFAEDEH TLQRQQANLE RFGASALADD VSLILKADKA IVLARRMVSR MVRGEEAAKA
A