Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSc2224 |
Symbol | |
ID | 1221069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ralstonia solanacearum GMI1000 |
Kingdom | Bacteria |
Replicon accession | NC_003295 |
Strand | + |
Start bp | 2411998 |
End bp | 2413095 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637238623 |
Product | dioxygenase alpha subunit |
Protein accession | NP_520345 |
Protein GI | 17546943 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAATC TCAGCACCGC GCTGAATCTG GTGCCGTCTG AAACCCAGCT GCCCGTCTCC GCATACTTCG ACGAGGCGCT GTACCAAACC GAAATCGAAC GTCTGTTCAA GCATGGCCCG AGCTACGTCG GCCATGAGCT GATGGTGCCC GAGGTGGGCG ACTATCACAC GCTTGCCGCC GAGGCCGAAG GCCGCGTGCT GGTACGCAAT CCGAACGGCG TCGAACTGCT ATCCAACGTA TGCCGGCATC GCCAGGCGAT CATGCTCAAT GGGCGGGGCA ATGCCCAGAA CATCGTCTGC CCGCTGCACC GCTGGACATA CGACCTGAAG GGCGAACTGC TGGGCGCGCC GCATTTCGAG CGGCAGCCGT GCGTGCACCT GTCGCGCTCG CTGCTGCAGA ACTGGAACGG CCTGCTGTTC GAGGGCAAGC GCGACGTGCG CAACGACCTC GCCCGCCTGG GCGTGGCGCG CGACCTCGAC TTCTCCGGCT ACATGCTCGA CCACGTCGAG GTGCACGACT GCGACTACAA CTGGAAGACC TTCATCGAGG TCTACCTGGA GGACTACCAC GTCGTGCCCT TCCACCCCGG CCTCGGCCAG TTCGTCTCGT GCGACGACCT GACCTGGGAA TTCGGCGAGT GGTACAGCGT GCAGACGGTC GGCATCCACG CCGGCCTGCG CAAGCCCGGC ACGGCGACCT ACCAGAAGTG GCATGACGCC GTGCTGCGCT TCAACAACGG CGAGATGCCC AAGTACGGCG CGGTATGGCT GACGTACTAC CCGAACGTGA TGGTGGAGTG GTACCCGAAC GTCCTGGTGG TCTCGACCCT GCATCCGATG GGCCCGGGCA AGACCCGCAA CGTGGTCGAG TTCTATTACC CGGAAGAAAT CGTGCTGTTC GAGCGCGAAT TCGTCGAGGC CGAGCGCGCC GCCTACATGG AGACCTGCAT CGAGGACGAC GAGATCGCCG AGCGCATGGA TGCCGGCCGG CTGGCCCTGC TCAGGCGCGG CACCAGCGAG GTCGGGCCTT ACCAGTCGCC GATGGAAGAC GGCATGCAGC ATTTCCACGA GTGGTACCGC CGCGTGATGG ACTATTGA
|
Protein sequence | MSNLSTALNL VPSETQLPVS AYFDEALYQT EIERLFKHGP SYVGHELMVP EVGDYHTLAA EAEGRVLVRN PNGVELLSNV CRHRQAIMLN GRGNAQNIVC PLHRWTYDLK GELLGAPHFE RQPCVHLSRS LLQNWNGLLF EGKRDVRNDL ARLGVARDLD FSGYMLDHVE VHDCDYNWKT FIEVYLEDYH VVPFHPGLGQ FVSCDDLTWE FGEWYSVQTV GIHAGLRKPG TATYQKWHDA VLRFNNGEMP KYGAVWLTYY PNVMVEWYPN VLVVSTLHPM GPGKTRNVVE FYYPEEIVLF EREFVEAERA AYMETCIEDD EIAERMDAGR LALLRRGTSE VGPYQSPMED GMQHFHEWYR RVMDY
|
| |