Gene RSc1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc1044 
Symbol 
ID1219853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp1098911 
End bp1100047 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content68% 
IMG OID637237407 
Productputative protease transmembrane protein 
Protein accessionNP_519165 
Protein GI17545763 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.163907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC CGACCCCGCC GAAATCGCCG GAAGAAGGCG CCGGCAAGCC CGATGAACTG 
GAATTCACGC ACCAGGCGGA CCATCCGCTG GAGGCCGAGC TGCGAGATGC CGCTGCCGGC
AAGCCGGCGT CCAGGCCCGG TCTGTTCGGC CGCTTCCGAC ATGGCGAGAG CGGCGCGCCG
CGCGCATCGG GCGCCCCCGC CGGCTGGGAG CGCGACGTAC TCGAGCGCGT CCTCCTGGCG
GCGATCCGCG AGCAGCGCGC CGCCCGCCGC TGGCGCATCT TTTTCCGCTT CGTGACGCTG
GGCATCATCG GCGGGCTGCT GTATCTGTTC GCCAGCTTCG AGGGCGAGAC CGTCAGCTCC
GGCCGCCACA CCGCGCTGGT GACGCTCGAT GGCGAGATCG CCGCCAACAC CAACGCCAGC
GCCGACAACA TCAACGCCTC GCTGGAAGCC GCGTTCGCCG ACGACAACAC CGCCGGGGTG
ATCCTCAAGA TCAACTCGCC GGGCGGCTCG CCGGTGCAGG CCGGCATGAT CAACGACGAC
ATCCGCCGCC TGCGCGCCAA GTACAAGAAC ATCCCGCTGT ATGTGGTGGT CGAGGAGATG
TGCGCCTCGG GCGGCTACTA CGTGGCCGCC GCCGCCGACA AGATCTATGT CGACAAGGCC
AGCATCGTCG GCTCGATCGG CGTGCTGATG GACGGCTTCG GCTTCACCGG CCTGATGGAC
AAGCTGGGTG TGGAGCGGCG TCTGCTGACG GCCGGTACCA ACAAGGGCAT GCTCGACCCG
TTCTCGCCGG TGGCCCCGCA GCAGCGGCAA TTCGCCCAGG CGATGCTCGA CGAGGTGCAC
CAGCAGTTCA TCGATGTGGT CAAGCAGGGG CGCGGCAGCC GCCTGAAGGA CGATCCGCAG
CTGTTCTCCG GCCTGTTCTG GACCGGTGCC AAGGCGGTCG ATCTGGGCTT GGCGGACGGC
ATCGGCGGCA CCGATTTCGT CGCCCGCAAC ATCATCAAGG CGCCGGACTT GGTCGACTAC
ACGGTCAAGG AGAACTTCGC CGAGCGCGTG GCACGCAAGT TCGGCACGGC CATGGGCGCA
GGGGCCATCA AGGCGCTGGC CGCGACCGGC CAGCTCAAGC TCCTGATGAG GCAGTAG
 
Protein sequence
MTDPTPPKSP EEGAGKPDEL EFTHQADHPL EAELRDAAAG KPASRPGLFG RFRHGESGAP 
RASGAPAGWE RDVLERVLLA AIREQRAARR WRIFFRFVTL GIIGGLLYLF ASFEGETVSS
GRHTALVTLD GEIAANTNAS ADNINASLEA AFADDNTAGV ILKINSPGGS PVQAGMINDD
IRRLRAKYKN IPLYVVVEEM CASGGYYVAA AADKIYVDKA SIVGSIGVLM DGFGFTGLMD
KLGVERRLLT AGTNKGMLDP FSPVAPQQRQ FAQAMLDEVH QQFIDVVKQG RGSRLKDDPQ
LFSGLFWTGA KAVDLGLADG IGGTDFVARN IIKAPDLVDY TVKENFAERV ARKFGTAMGA
GAIKALAATG QLKLLMRQ