Gene RSc2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc2224 
Symbol 
ID1221069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp2411998 
End bp2413095 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID637238623 
Productdioxygenase alpha subunit 
Protein accessionNP_520345 
Protein GI17546943 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATC TCAGCACCGC GCTGAATCTG GTGCCGTCTG AAACCCAGCT GCCCGTCTCC 
GCATACTTCG ACGAGGCGCT GTACCAAACC GAAATCGAAC GTCTGTTCAA GCATGGCCCG
AGCTACGTCG GCCATGAGCT GATGGTGCCC GAGGTGGGCG ACTATCACAC GCTTGCCGCC
GAGGCCGAAG GCCGCGTGCT GGTACGCAAT CCGAACGGCG TCGAACTGCT ATCCAACGTA
TGCCGGCATC GCCAGGCGAT CATGCTCAAT GGGCGGGGCA ATGCCCAGAA CATCGTCTGC
CCGCTGCACC GCTGGACATA CGACCTGAAG GGCGAACTGC TGGGCGCGCC GCATTTCGAG
CGGCAGCCGT GCGTGCACCT GTCGCGCTCG CTGCTGCAGA ACTGGAACGG CCTGCTGTTC
GAGGGCAAGC GCGACGTGCG CAACGACCTC GCCCGCCTGG GCGTGGCGCG CGACCTCGAC
TTCTCCGGCT ACATGCTCGA CCACGTCGAG GTGCACGACT GCGACTACAA CTGGAAGACC
TTCATCGAGG TCTACCTGGA GGACTACCAC GTCGTGCCCT TCCACCCCGG CCTCGGCCAG
TTCGTCTCGT GCGACGACCT GACCTGGGAA TTCGGCGAGT GGTACAGCGT GCAGACGGTC
GGCATCCACG CCGGCCTGCG CAAGCCCGGC ACGGCGACCT ACCAGAAGTG GCATGACGCC
GTGCTGCGCT TCAACAACGG CGAGATGCCC AAGTACGGCG CGGTATGGCT GACGTACTAC
CCGAACGTGA TGGTGGAGTG GTACCCGAAC GTCCTGGTGG TCTCGACCCT GCATCCGATG
GGCCCGGGCA AGACCCGCAA CGTGGTCGAG TTCTATTACC CGGAAGAAAT CGTGCTGTTC
GAGCGCGAAT TCGTCGAGGC CGAGCGCGCC GCCTACATGG AGACCTGCAT CGAGGACGAC
GAGATCGCCG AGCGCATGGA TGCCGGCCGG CTGGCCCTGC TCAGGCGCGG CACCAGCGAG
GTCGGGCCTT ACCAGTCGCC GATGGAAGAC GGCATGCAGC ATTTCCACGA GTGGTACCGC
CGCGTGATGG ACTATTGA
 
Protein sequence
MSNLSTALNL VPSETQLPVS AYFDEALYQT EIERLFKHGP SYVGHELMVP EVGDYHTLAA 
EAEGRVLVRN PNGVELLSNV CRHRQAIMLN GRGNAQNIVC PLHRWTYDLK GELLGAPHFE
RQPCVHLSRS LLQNWNGLLF EGKRDVRNDL ARLGVARDLD FSGYMLDHVE VHDCDYNWKT
FIEVYLEDYH VVPFHPGLGQ FVSCDDLTWE FGEWYSVQTV GIHAGLRKPG TATYQKWHDA
VLRFNNGEMP KYGAVWLTYY PNVMVEWYPN VLVVSTLHPM GPGKTRNVVE FYYPEEIVLF
EREFVEAERA AYMETCIEDD EIAERMDAGR LALLRRGTSE VGPYQSPMED GMQHFHEWYR
RVMDY