Gene RoseRS_3776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3776 
Symbol 
ID5210758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4724393 
End bp4726057 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content60% 
IMG OID640597372 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001278080 
Protein GI148657875 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000930307 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000981253 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCACACAC GTCTTCAACA CATCATCCTC ATCATTGGTC TGTTCTCGCT GTCTGGTTTT 
GTTTTGTCCG CCATAGGGGC GCGCGCTGCA GAACCGCCAT CCACCATTCC GCCGAATGTC
GTCGGCGGCA CACCGGTTGC ACCGGGGGAT TACCCATGGC TCGTCGGGTT GCTCAATGCC
AGCGTGCAGG ATGAGGCTGC CGTCTTCTGC GGCGGTGCGC TGATCGACGA CGGCGCTCCG
ACCGCCAGTT CCCAATGGGT GCTTACGGCA GCGCACTGTC TGGTTATTAA CGGTGAGGTT
GTGTCGCCGT CAGCGATCGA GGTGCTTGCC GGTCAGCCTG ATTTGACGCA GGTTCAGCCG
GAACAACGGC ATCCGGTTGC AGACATTATT GTGCATCCGC TTTATATCTA CGGATATGCG
CCTGTCAACG ATATTGCGTT GCTCCGCCTG GCTGCACCGG TGAACGTGGG GAACACCCTG
CCTGTCGCCA CTCCCGCTGA TGCTGCATTC TTTGCACCCG GCGTCGATGC GCAGATCGCC
GGGTGGGGCA ATCTCCTGCC GCAGACCGGC GTTCAGCAAC CGGACATTGC ACACAAGGCG
GTCGTCAAGA TCGTGGATGA CGCCACATGC AATGCGCGGT ATGATCGCGC GCTGGGTAGT
GAGCATCTCT GCGCGGGCAA TATGCCGGAT GGCGGCGTTG ACACCTGTCA GGGCGACAGC
GGCGGTCCGC TTATGGTAGT GAAAGGCAGC ACCCTGATCC ACGCCGGTAT TGTCAGTTTT
GGTCAGGGGT GCGCCTGGCC CCATTTCCCC GGCGTTTATG CGCGCACAGC AACCTATGCT
GGCTGGATCA ATGCGGTCAT CAACGACCAA CCGCACGTTG ATGTCATACA GGAGGGTCCG
TATCCTATCT GGCCCGGTGA GGATGTTCCC GACCCCACGT TGTTCCATAT CTCGACTGCT
GCGCCAGGCG AGCCGTTTAC GTATACGCTA CGGGTCGCCA ACACCGGTAT GCAGGTCCTC
GATACGCTGA CCATTACTGC CACATTGCCC GGCGGCGCCA GTCTGATCGC CATTAACGAC
GGTGGAATCG TCGCAGATCC TGTTATCACC TGGACTGCGA CCGATCTGGC GCCCGGCGAG
GTGATCGAGC GCTCGTATGT CGTCTCTGCC ACCACAAGCG TCACGACCGG TCCGTATGGG
GTGATGGCGA TCAGCGGCTC AACAACGGTA ACGTCGTCCG GGCGTTTGCC GATCACGACG
CTGATCAATA CACCGCGGCT GCACCTTCGC ACATTTGCTG AATCAGAGGT GGAAGTCGGA
TCGGTCTTTG GACAACTCTT CCTGCTGGTC AATTTTGGGC AGGGTAATGG AGCGGGCGTG
ACAACGCCCA TAGATGTGCG CGCAAAGATT CCCCCTGGCG CCGGCATTGT GGCTATCAAT
AGTGGATCGA TCGTTGGCGA CGAGGCGCGC TGGCAGATTT CAGGGTTAAG CGGGGGGGGT
TCCGTGTTTC TGGGTATGGA TGTGCGCGCC GGAGCTGTTG GCAGCGTCTT CCGCATCACC
GATTACCGGG CGCAGATCGG CAGTGCGACA CCCACTCTCG GCGCAACCGC AAGCCAGACA
GTCGTCAACG AAGCACGCCG TTATCTCCCC CTGATAGCCC GCTGA
 
Protein sequence
MHTRLQHIIL IIGLFSLSGF VLSAIGARAA EPPSTIPPNV VGGTPVAPGD YPWLVGLLNA 
SVQDEAAVFC GGALIDDGAP TASSQWVLTA AHCLVINGEV VSPSAIEVLA GQPDLTQVQP
EQRHPVADII VHPLYIYGYA PVNDIALLRL AAPVNVGNTL PVATPADAAF FAPGVDAQIA
GWGNLLPQTG VQQPDIAHKA VVKIVDDATC NARYDRALGS EHLCAGNMPD GGVDTCQGDS
GGPLMVVKGS TLIHAGIVSF GQGCAWPHFP GVYARTATYA GWINAVINDQ PHVDVIQEGP
YPIWPGEDVP DPTLFHISTA APGEPFTYTL RVANTGMQVL DTLTITATLP GGASLIAIND
GGIVADPVIT WTATDLAPGE VIERSYVVSA TTSVTTGPYG VMAISGSTTV TSSGRLPITT
LINTPRLHLR TFAESEVEVG SVFGQLFLLV NFGQGNGAGV TTPIDVRAKI PPGAGIVAIN
SGSIVGDEAR WQISGLSGGG SVFLGMDVRA GAVGSVFRIT DYRAQIGSAT PTLGATASQT
VVNEARRYLP LIAR