Gene RSp1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSp1020 
SymbolepsA 
ID1223332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp1286203 
End bp1287348 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content66% 
IMG OID637240885 
ProductEPS I polysaccharide export outer membrane transmembrane protein 
Protein accessionNP_522581 
Protein GI17549241 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.784638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.873856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTAA GCATTCCGAA TATCCGAAAA GCAGTCGTGT CGTTGAGCGT GGTGCCGTTG 
CTGGCCGCAT GCGCATTTGC CCCGGGCATG CGGTTCGATC CGCAGCGCCC GCTGGATCCG
GCCGACAACG CGTCGGTACC GAAGATCACG CCCATTACGC CCGATCTTGT GCGGGCCGGG
CAGACGCAGG CACAGGTGCA GGCGTCGCAC GAGAATGCCG ATGTCGGGCC GTTGCTGGCA
AAGGCAACGC CGTATCGCAT CGGCACGGGC GACATCCTGT CGATCGTGGT CTGGGATCAC
CCCGAACTGG TGTTCCCGAC GCAGACCTAT TCGATCGGGT CCACATACGA TCTTGCCAGC
TTTGGCGGGG CACCCAGCGT GTCCGGCTAT GTGGTCAGCA CCGGTGGCGA CATCCAGTTC
CCCTATGCCG GCGTCATCAA GGTCGCAGGC AAGACCCAGA ACGAAGTCCG CGACGAGATC
TCGCGTGGCA TTGCCCGGGT GGTGAAGGAC CCGCAGGTCA CGGTGCGGGT GCTGGCCTAC
CGCAGCCAGC GGGTCTACGT GGATGGTGAG GTCAAGACCC CCGGCCAGCA GAGCATCGAC
GACGTGCCGA TGACCCTGGT CGAGGCGCTG AACCGCGCCG GCGGCATCAA CACCACCACC
GGGGACAACA GCCGGATCCG GCTGACCCGC GGCGGCAAGC AATGGACGCT GAGCATGCCC
GCGCTGATGC AGCAGGGCAT CGACCCGGCC AACATTCTGC TGCGCGGCGG CGACATCGTC
CGCGTGGAGC AGCGCGAGGA CAGCAAGGTC TTCGTGACCG GCGAAGTGGT CAGACCGTCG
ACCGTGCTGC CGCGCAACGG CAGGCTGACG CTGAGCGAAG CGCTGGGCGA GGCCGGGGGC
GTCAGCCCGG TGTCGTCCGA TCCGCGCAAT GTCTACGTGA TCCGCCGGGC CGCGGAGGGC
GAGCCCCAGG TCTACCACCT GGATGCCAAG TCGCCCGTGG CGCTGGCGCT GGCCGAAGGC
TTCGAGCTGA AACCGAAGGA CGTGGTGTAC GTGGATGCCG GCAGCCTGGT GCGCTGGAGC
CGTGTGATCA ACCTCTTGGT GCCGACCGCA ACCCCGCTGA TCGGGGCGGC CGCTGTCGCG
AAATGA
 
Protein sequence
MFVSIPNIRK AVVSLSVVPL LAACAFAPGM RFDPQRPLDP ADNASVPKIT PITPDLVRAG 
QTQAQVQASH ENADVGPLLA KATPYRIGTG DILSIVVWDH PELVFPTQTY SIGSTYDLAS
FGGAPSVSGY VVSTGGDIQF PYAGVIKVAG KTQNEVRDEI SRGIARVVKD PQVTVRVLAY
RSQRVYVDGE VKTPGQQSID DVPMTLVEAL NRAGGINTTT GDNSRIRLTR GGKQWTLSMP
ALMQQGIDPA NILLRGGDIV RVEQREDSKV FVTGEVVRPS TVLPRNGRLT LSEALGEAGG
VSPVSSDPRN VYVIRRAAEG EPQVYHLDAK SPVALALAEG FELKPKDVVY VDAGSLVRWS
RVINLLVPTA TPLIGAAAVA K