Gene RSc1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc1566 
SymbolaroC 
ID1220397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp1681674 
End bp1682774 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID637237951 
Productchorismate synthase 
Protein accessionNP_519687 
Protein GI17546285 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGCA ATACCCTGGG CCTGCTGTTT TCCGTCACCA CCTTCGGCGA GTCGCACGGC 
CCGGCCATCG GTGCCGTCAT CGACGGCTGC CCGCCGGGCA TGGCGCTGTC GGCCGAGGAC
ATCCAGCCCG ATCTCGACCG CCGCAAGCCC GGCACCTCGC GCCACGTCAC GCAGCGCAAG
GAAGAAGACC TTGTCGAGAT CCTGTCCGGC GTGTTCGAGG GCAAGACCAC CGGCACGCCC
ATCTGCCTGC TGATCCGCAA CACCGACCAG CGCAGCAAGG ACTACGGCAA CATCGTCGAG
ACCTTCCGCC CGGGCCATGC CGACTACACC TACTGGCACA AGTACGGCAT CCGCGACCCG
CGCGGCGGCG GCCGTTCGTC GGCCCGGCTG ACGGCGCCCG TGGTGGCGGC CGGCGCCGTC
GCCAAGAAAT GGCTGCGCGA GAAGTTCGGC GTCGAGATCC ACGGCTACAT GTCGCAGCTG
GGCGAGATCC GGATTCCGTT CCTCGACTGG AACGAGGTGC CGAACAACCC GTTCTTCGCG
CCCAACGCCG AGATCCTCCC CGAGCTCGAA ACCTACATGG ACGCGCTGCG CCGCGACGGC
GACTCCGTCG GCGCGCGCAT CGAGGTGGTG GCGACCGGCA TGCCGGTCGG CTGGGGCGAG
CCGCTGTTCG ACCGCCTGGA CGCCGACATC GCCCATGCCA TGATGGGCCT GAATGCAGTG
AAGGGCGTGG AGATCGGCGC GGGCTTTCAT GCCGTGTCGC AGCGCGGCTC CGAGCACGGC
GACGAACTGA CGCCGGCCGG CTTCGTCGGC AACAACGCGG GCGGTATCCT GGGCGGCATT
TCCACCGGGC AGGACATCTC GGTATCGCTG GCGATCAAGC CGACCTCCAG CATCCGCACG
CCGCGCCGCT CGATCGACAA GGCGGGCGAG CCGACCGCGG TCGAGACGTT CGGCCGCCAC
GATCCGTGTG TCGGTATCCG CGCCACGCCG ATCGCCGAGG CGCTGCTGGC GCTGGTGCTG
ACCGACCATG CGCTGCGCCA TCGTGCCCAA TGCGGCGACG TGGCGGTGGC GACCCCGGCC
ATCGCCGCCA AGGCGCCGTA A
 
Protein sequence
MSGNTLGLLF SVTTFGESHG PAIGAVIDGC PPGMALSAED IQPDLDRRKP GTSRHVTQRK 
EEDLVEILSG VFEGKTTGTP ICLLIRNTDQ RSKDYGNIVE TFRPGHADYT YWHKYGIRDP
RGGGRSSARL TAPVVAAGAV AKKWLREKFG VEIHGYMSQL GEIRIPFLDW NEVPNNPFFA
PNAEILPELE TYMDALRRDG DSVGARIEVV ATGMPVGWGE PLFDRLDADI AHAMMGLNAV
KGVEIGAGFH AVSQRGSEHG DELTPAGFVG NNAGGILGGI STGQDISVSL AIKPTSSIRT
PRRSIDKAGE PTAVETFGRH DPCVGIRATP IAEALLALVL TDHALRHRAQ CGDVAVATPA
IAAKAP