Gene RSc3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc3103 
Symbol 
ID1221967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp3341193 
End bp3342296 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content64% 
IMG OID637239521 
Product4-hydroxyphenylpyruvate dioxygenase oxidoreductase protein 
Protein accessionNP_521224 
Protein GI17547822 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTCA CGCCTTGGGA AAACCCGATG GGCACCGCCG GCTTCGAGTT CATCGAATAC 
GCCGCGCCGG ACCCCGTCGC CATGGGCAAG CTGTTCGAGA AGATGGGCTT CAGCGCCATC
GCGAAGCACC GCCACAAGAA CGTGACGCTG TACCGCCAGG GCGGCATCAA CTTCATCATC
AACGCTGAAG CCGATTCGTT CGCGCAGCGC TTCGCGCGCC TGCACGGGCC GTCCATCTGC
GCCATCGCCT TCCGCGTGCA GGACGCGGCC CTCGCCTACC AGCGCGCGCT GGAACTGGGC
GCGTGGGGCT TCGACACCCA CAGCGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC
ATCGGCGATT CGCTGATCTA CCTGGTGGAC CGCTGGACCG GCAAGAACGG CGCCAAAGAC
GTCGACATCG GCAACATCAG TATCTACGAC GTCGACTTCG TGCCCATTCC CGGCGCCAAC
CCGAACCCCA TCGGGCACGG CCTGACCTAC ATCGACCACC TGACGCACAA CGTCTACCGT
GGTCGCATGA AGGAGTGGGC CGAGTTCTAC GAACGCTTCT TCAACTTCCG CGAGATCCGC
TATTTCGATA TCGAGGGCCA GGTCACCGGC GTGAAGAGCA AGGCCATGAC GAGCCCGTGC
GGCAACATCC GCATTCCCAT CAACGAGGAA GGGACGGAGA AGGCCGGCCA GATCCAGGAA
TACCTGGACA TGTACCACGG CGAGGGCATC CAGCACATCG CGCTGGGCTC GACCGACCTG
CACCGGACGG TGGACGCGCT GCGCGGCAAC GGCATCAAGC TGCTGGACAC CATCGACACG
TACTACGAGC TGGTCGACAA GCGGATCCCC GGCCATGGCG AGAACGTGGC GGAGCTGAAG
AAGCGCAAGA TCCTGATCGA CGGCGCGCCG GGCGACCTGC TGCTGCAGAT CTTCTCGGAA
AACCAGCTGG GCCCGATCTT CTTCGAGTTC ATCCAGCGCA AGGGCAACCA GGGCTTCGGC
GAGGGCAACT TCAAGGCGCT GTTCGAGTCG ATCGAGCTCG ACCAGATGCG CCGCGGCGTG
CTGAAGGCGG ATCAGCCGGC CTGA
 
Protein sequence
MSFTPWENPM GTAGFEFIEY AAPDPVAMGK LFEKMGFSAI AKHRHKNVTL YRQGGINFII 
NAEADSFAQR FARLHGPSIC AIAFRVQDAA LAYQRALELG AWGFDTHSGP MELNIPAIKG
IGDSLIYLVD RWTGKNGAKD VDIGNISIYD VDFVPIPGAN PNPIGHGLTY IDHLTHNVYR
GRMKEWAEFY ERFFNFREIR YFDIEGQVTG VKSKAMTSPC GNIRIPINEE GTEKAGQIQE
YLDMYHGEGI QHIALGSTDL HRTVDALRGN GIKLLDTIDT YYELVDKRIP GHGENVAELK
KRKILIDGAP GDLLLQIFSE NQLGPIFFEF IQRKGNQGFG EGNFKALFES IELDQMRRGV
LKADQPA