Gene Swit_2634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_2634 
Symbol 
ID5199970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp2917282 
End bp2918634 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content68% 
IMG OID640582190 
Productbenzoate 1,2-dioxygenase, alpha subunit 
Protein accessionYP_001263130 
Protein GI148555548 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGT CCATCCGCGA CCGCGTGGCG ACCGCCGTCG TCGACGATCC GGCCAGCGGG 
GCGTTCCGTT GCCGGCGGGA CATCTTCACC GATCCCGACA TCTTCGACCT CGAGATGAAG
CACATCTTCG AGGGCAATTG GGTCTATCTC GCCCATGAGA GCCAGGTCGC GGGCAAGAAC
GACTATTTCA CCACGTCGAT CGGCCGGGTC CCCGTCATCC TGACGCGGGG CAAGGACGGC
GCGATCAACG CCTTCGTCAA CGCCTGCTCG CATCGCGGCG CCCAGCTCTG CCGGCGCAAG
CGCGGCAACC AGCCGCTGCT CGTCTGTCCC TTCCACGGCT GGAGCTTCCG CACCGACGGG
TCGCTGCTCA AGGCGAAGGA CGCGGCGACC GGCGCCTATC CGGACAGCTT CGACCGCGAC
GGATCGCACG ACCTGACCCG CATCGCCCGC TTCGCCGACT ATCGCGGCTT CCTGTTCGGC
AGCCTCAACC CCGACGTCGC GCCGCTGGAG GACTATCTGG GCGAGACCCG GCTGATCATC
GACCAGATCG TCGACCAGGC GCCCGAGGGC CTGGAGATCC TGACCGGCAA TTCGACCTAC
ATCTTCGACG GCAACTGGAA GCTGCAGATG GAGAATGGCT GCGACGGCTA CCATGTCAGC
TCGGTCCACG CCAACTACGC CTCGACCATG GCGCGGCGCG CGGAGGGCGG CACCCGGGCG
GTCGACGCCA ATGGCTGGTC GAAGGCGGTG AGCGGCGTCT ACGGCTTCGA GAACGGCCAT
ATCCTGCTGT GGACCCGCGT GCTCAACCCG GAGGTCCGCC CGGTCTGGAC GCAGCGGGCG
GCGCTCGCGG AGCGGCTCGG CGCGGACCGG GCCGAGATCA TCGTCAGCCA GAGCCGCAAC
CTGGCGCTCT ATCCCAACGT CTTCCTGATG GACCAGTTCT CCACCCAGAT CCGGGTGGTG
CGCCCGATCG ACGTCCACCG GACCGAGGTG ACGATCTACT GCTTCGCGCC GAAGGGCGAG
AGCGCCGAGC TCCGCGCGAC CCGCATCCGG CAATATGAGG ATTTCTTCAA CGTCTCCGGC
ATGGGCACAC CCGACGACCT GGAGGAGTTC CGCTCCTGCC AGTCGGCCTA TGAAGGGGCG
GGCGCGCTGT GGAACGACCT CAGCCGCGGG GCGACGCGCT GGATCGCCGG GCCCGACGAC
AATGCCCGCG CGATGGGGAT GAATCCGCTG CTGTCGAGCG AGCGCAGCGA GGATGAAGGG
CTGTTCGTTC GCCAGCACGA ATATTGGGCG CGCGCGATGC TCGCCGGGAT CGACCGGGAG
AGCGATGCCG CCCTGCTGGA GGCGGCGGAA TGA
 
Protein sequence
MTMSIRDRVA TAVVDDPASG AFRCRRDIFT DPDIFDLEMK HIFEGNWVYL AHESQVAGKN 
DYFTTSIGRV PVILTRGKDG AINAFVNACS HRGAQLCRRK RGNQPLLVCP FHGWSFRTDG
SLLKAKDAAT GAYPDSFDRD GSHDLTRIAR FADYRGFLFG SLNPDVAPLE DYLGETRLII
DQIVDQAPEG LEILTGNSTY IFDGNWKLQM ENGCDGYHVS SVHANYASTM ARRAEGGTRA
VDANGWSKAV SGVYGFENGH ILLWTRVLNP EVRPVWTQRA ALAERLGADR AEIIVSQSRN
LALYPNVFLM DQFSTQIRVV RPIDVHRTEV TIYCFAPKGE SAELRATRIR QYEDFFNVSG
MGTPDDLEEF RSCQSAYEGA GALWNDLSRG ATRWIAGPDD NARAMGMNPL LSSERSEDEG
LFVRQHEYWA RAMLAGIDRE SDAALLEAAE