Gene RoseRS_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3989 
Symbol 
ID5210972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4990126 
End bp4991415 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID640597580 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001278286 
Protein GI148658081 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTATTAC GCCGAAGCAG GCATTCGTCA GTCGTGGAAA AATTTATTGC CGTCCGCAAG 
CGAACACGAA CCGCAGCATT TGCGCTGCAC AATGTCGCAA CTGCGGAAAA GCGCTCGGAG
ATCATGCGCG ATGCGCAGAC ATCGCGCCGC ATGATGGAGT TGATCCGTCT CGGATTAAGG
CCCCACCTGC CGGGTCAGCG GAAACCGGTC GGTCATGGGT TCGAGCGGCT GGGACACGTG
AGTTGTTACA TCGTCAACGC GCCAACCGAT CGACAGGCTG AACAGGCGCG CGAGATACTC
GCAGACGACT ACCTGATCGT TCCCGATGTG CCGCTCTCAC TGCCGGTTGC GCGCACCGGC
GCCGAAACCA GGTTTCGCCG TCCGCGCGCA CCGGAGTGGC CCGACGTGAG CGGGATCCAG
GAAGCGCATC GCAGAGGGAT TACCGGCGAA CAGGTGATTG TCGGCATACT CGATACCGGT
TGTGACGCCG ACCACAACGA GTTTGCCGGG AAACGGATCG AGTTCCGGTA TGTGCCATTC
GTGCCAACGC CCGAAAGTAT GCGCGCAGTC AACGGGTTCG ACACCCATGG TCACGGCACC
CACGTGTGCG GCATTATCGC CGGGCGCAAC GTCGGCGTCG CGCCGGGTGT CGAACTGCTG
GCAGCGGCGG TGATCGAAAG CGAGACGGTC AAAACCAGTC TGGAACGGAT CGTCGTCGCG
CTGGACTGGA TGCTGTCCCA TTTCAGTCTG GCGGAAAACC AGCACAAACC GATGATCATC
AGCATGTCGC TCGGTTTCCG CCCGGAATGG ATCAGCGCAC CGGCGTTCAA AACGGTGACC
GATGGGATGC GACTGTTGCT GCGCACGCTG GTGGAGGATT TCGACGTGCT GCCGATCGTC
GCCATTGGCA ACGACGGTCC CGGCGTCATT CGCGCACCCG GATCGTATGC CGAAGCGCTG
GGGGTCGGCG CAGTCGATTT CGATCTCAAC CCCTGGCCCG GCAGCAGCAG CGGGCAGACG
CCCGACGGAC GCCACAAGCC GGATATTGTC GGTTTTGGCG TCAACATTCT GTCGAGCCTG
GAACGAGATC TTCAACGTCG CAGTCTATAC GCCAGGATGA GCGGCACGAG CATGGCAGCG
CCATACGTGG CGGGCATCGC AGCGCTGATT GCTTCGGCAA ATCCCGGATT GCAGGGAGCG
GCGCTGCGTC AGCGATTGCT GGAAACGGCG CTGCCGTTAT CGGCGCCCGC CGAACGGGTC
GGCGCCGGGC TGGCGAGGTT TGTTGTATGA
 
Protein sequence
MVLRRSRHSS VVEKFIAVRK RTRTAAFALH NVATAEKRSE IMRDAQTSRR MMELIRLGLR 
PHLPGQRKPV GHGFERLGHV SCYIVNAPTD RQAEQAREIL ADDYLIVPDV PLSLPVARTG
AETRFRRPRA PEWPDVSGIQ EAHRRGITGE QVIVGILDTG CDADHNEFAG KRIEFRYVPF
VPTPESMRAV NGFDTHGHGT HVCGIIAGRN VGVAPGVELL AAAVIESETV KTSLERIVVA
LDWMLSHFSL AENQHKPMII SMSLGFRPEW ISAPAFKTVT DGMRLLLRTL VEDFDVLPIV
AIGNDGPGVI RAPGSYAEAL GVGAVDFDLN PWPGSSSGQT PDGRHKPDIV GFGVNILSSL
ERDLQRRSLY ARMSGTSMAA PYVAGIAALI ASANPGLQGA ALRQRLLETA LPLSAPAERV
GAGLARFVV