Gene RoseRS_2829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2829 
Symbol 
ID5209798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3529865 
End bp3532864 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content62% 
IMG OID640596426 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001277148 
Protein GI148656943 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0476517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCTC GCGCTGCTGT TTCGCGCCTC CTCATTATTG TCCTGGTTCT CCTTTCTTCG 
TGTGTGGGTG ATTCTGGCAG AATCGCTGTG CATGCATCAG GAGTTCTTGC CGCTTCTCCT
GAGTCAATCG GTGAAACGCT GGCGTCGGGT CAGCAGGTCA CGCGACCGCT GGTTATCACG
AATACCGGGA CAACCCCGAT CACCGCCTTG CTGTACGAAG CGTATGCGCA ACCATCGCTG
GCAATGGCGC ATGCCTGGGG TCCTGCCAGC GTACCATTGC CACAGCAGGA ACAATCGCTC
GACCCGCGCC TGGCAACGCA ACTGGACGAA CCCTCCGAGC GTGGTGCGTT CATTGTCTAT
TTGCACGACC AGGCAGACCT GAGCGCGGCG TATGGCATAA CCGACTGGGG GGAACGCGGG
TGGTTCGTTT ACCGCACGCT GGTCGAACAC GCGGAGCGCA CCCAGCGTGC TCTGCGCGCT
GAACTGTCGG CGCGCGGTCT GGCATACCGA CCATTCTGGA TTGTCAATGC GGTCTATGTC
GAAGGAACGC TCACCGATGC GCAGGCGCTG GAACGACGCG CCGACGTTGC GCTGGTTCGC
ACCGATGCGC GTGTGGCGGT TGCGCCGCAG GTGTCGGCGC CAGCGAGTCT CGATGAGCGT
TGCAGTTCGG ACGGTAATCC GGTGTGCTGG AATATCCGGG CGATTGGCGC CGACCGGGTC
TGGAATGAGT TTGGCATCAC CGGTCAGGGA GTCACCGTTG CGTCAATCGA CACCGGCGTG
TTGGGGATCC ATCCGGCGCT GCGCGACCGG TATCGCGGCG CGCTTGGCGG CGGAATGTAC
GATCACAACT ACAACTGGTA CGATCCGCAG GGTGTGTTTC CGATGCCGGT CGATCAGAAC
GGGCACGGCA CCCACACCAC CGGCACGATC GTCGGCAGTC GACCGGGCGG TGAGCGGTTC
GGCGTTGCGC CGGGCGCACG GTGGATTGCT GCGCAGGGGT GCGATGGGTC ATTCTGTAGC
GAAAGCGATC TGTTTGCCGC CGCGCAGTGG ATCCTGGCGC CGACCGATCT CAACGACCGC
AATCCGCGAC CGGACCTTCG CCCGATGATT GTCAACAACT CATGGGCAGG CGGCAGCAAC
GACCCATGGT ATGCCGGCTA TACCGCCGCC TGGCGCGCAG CGGGCATCTT TCCGGTCTTC
GCGGCGGGCA ACGGCGTCGG CGCCTGCCGC ACAATTGCGT CTCCAGGCGA TTATGCCGAT
GTGGTGGCAG TCGGCGCCAC CAATCGCAGC GGTTCGATTG CATCATTCAG TCTGCGCGGT
CCTGCCGCCG ATGGGCGGAT GAAGCCGGAC TTCGTCGCTC CCGGTGATGG CGGCATTTAC
TCTGCATCGC TCAATGACGG GTACACAACA CTGCGCGGCA CATCAATGGC GACTCCGCAT
GTCGCCGGGG TGGCTGCGCT GCTTTACGCT GCCAATCCGG CGCTGATCGG CGATTACGAC
GCAACCTATG CCATCCTGCG CGACACGGCG CGGCGGCGAG ATGATCCGCA GTGTGGCGTC
GTAGCAGGGG GCGGGAACAA TGTCTACGGG TGGGGGTTGA TCGACGCACA CGCGGCTGTT
GCCCGCGCGC GCGTTGATGT TCCCTGGTTG CGCCTCCCGA CGACGACCCT GAACCTTGAT
CCCGGTCAAA GCGCATCCGT CAACGTGACC CTCGACGCCA GCGGTGTTGC AACACCCGGA
ATCTACACGG CGCGCATCCA GATCTATGCC GGTGATCTGA CCCTTCCGCC GGTAACAGTG
ACGGTCACAA TGGTCGTGAC CGGTGCTGGC GGTACACTGG TGACCGGCAT TGTGCGTGAT
GCAGAGACCG GTACGCCGCT TGCGGCAACA GTCCGCACTG AGAACGGCGC GAGCACAACG
ACTGCGGCTG ACGGAACATT CGCACTGGTG CTGACCACGG GTGTCCATAC CCTGACCGCT
TCCGCTCGCT CGTATGCACC CCGGCAACGC ACGATCACCG TGCCGACCGA CGGATCGGTC
GATTTTGGGT TGCTGCTCGA TGCGCCGCAG GTTGCATTAT CGACCGATTA CGTGACAGCC
ACACTCGATT TCAACACCAG TGTCACGCGC ACGGTAACGA TCACGAACAC CGGCACACGT
CCATTGACGT TCGAAGCGAA GGTCGGGTAT GCGCCATTCG GCATCTATCG CAGTGACGAG
CCGGGGGGAC CGGTGTACCG GTGGATCGAC CTGCCACCTG ATGCACCAAC CCTGGAACTG
ACGAACACAA CCCGGATCGA TGGCGTGCCA TTGGGACTGA CATTCCCGCT CTATACCTAC
ACGGTTACCG AGACCTCCAT CACGTCGGAT GGGACGCTGA TGTTTGACTG GCCCTATCCG
TACACCGGTC TGCTGGAGCG TTGTCTGCCG GCGACTGAAG CATTCTTCAA CCTGCTTGCG
CCATTCCGCA CCGACCTTGA CCCGTCGCGC GGCGGAATTG TGCGTTATGG AACGGTCAAC
AACGGCACAA CGTTTGTGGT CAGTTTCGAG GATGTACCGA CCGCCGCAGG TCCGCCGGAT
CACACCTTCA CGTTCCAGAC GTTGCTTCAC CAGGATGGCC GGATTGTCTA CCAGTACCGC
GATCTGGGAG CGTTGCCGGA GCGTCTCAGT GTTGGGATTC AGAAAAGCCT GAACCAGGTG
CAACGGATCG GTTGCGGCGC CGATACGCCG ATCGCTCCGG GACTGGCCAT CGAGTTCAGA
CCACAGTTCC GTCCGGAGGG TTGGGTGATC GTGAACCCGG AGAAAGGGTC GCTTGAACCG
GGCGCCAGCA CAAACCTGAC GTTCACCTAC TTCTGGCAGG CGCCGCCGCA GGGGAACCAT
CTCCGCACAA CCATTGTCAT CTCCAGCAGC GATCCGCGTC GCCGCACAGC AACGATTATG
ACTGAGGCTG CGATGCGTCC GGCGCCGCAT GTGGTATGGC TGGGCATCGT GGCGCGGTAA
 
Protein sequence
MSARAAVSRL LIIVLVLLSS CVGDSGRIAV HASGVLAASP ESIGETLASG QQVTRPLVIT 
NTGTTPITAL LYEAYAQPSL AMAHAWGPAS VPLPQQEQSL DPRLATQLDE PSERGAFIVY
LHDQADLSAA YGITDWGERG WFVYRTLVEH AERTQRALRA ELSARGLAYR PFWIVNAVYV
EGTLTDAQAL ERRADVALVR TDARVAVAPQ VSAPASLDER CSSDGNPVCW NIRAIGADRV
WNEFGITGQG VTVASIDTGV LGIHPALRDR YRGALGGGMY DHNYNWYDPQ GVFPMPVDQN
GHGTHTTGTI VGSRPGGERF GVAPGARWIA AQGCDGSFCS ESDLFAAAQW ILAPTDLNDR
NPRPDLRPMI VNNSWAGGSN DPWYAGYTAA WRAAGIFPVF AAGNGVGACR TIASPGDYAD
VVAVGATNRS GSIASFSLRG PAADGRMKPD FVAPGDGGIY SASLNDGYTT LRGTSMATPH
VAGVAALLYA ANPALIGDYD ATYAILRDTA RRRDDPQCGV VAGGGNNVYG WGLIDAHAAV
ARARVDVPWL RLPTTTLNLD PGQSASVNVT LDASGVATPG IYTARIQIYA GDLTLPPVTV
TVTMVVTGAG GTLVTGIVRD AETGTPLAAT VRTENGASTT TAADGTFALV LTTGVHTLTA
SARSYAPRQR TITVPTDGSV DFGLLLDAPQ VALSTDYVTA TLDFNTSVTR TVTITNTGTR
PLTFEAKVGY APFGIYRSDE PGGPVYRWID LPPDAPTLEL TNTTRIDGVP LGLTFPLYTY
TVTETSITSD GTLMFDWPYP YTGLLERCLP ATEAFFNLLA PFRTDLDPSR GGIVRYGTVN
NGTTFVVSFE DVPTAAGPPD HTFTFQTLLH QDGRIVYQYR DLGALPERLS VGIQKSLNQV
QRIGCGADTP IAPGLAIEFR PQFRPEGWVI VNPEKGSLEP GASTNLTFTY FWQAPPQGNH
LRTTIVISSS DPRRRTATIM TEAAMRPAPH VVWLGIVAR