Gene RoseRS_1092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1092 
Symbol 
ID5208039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1356259 
End bp1359573 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content60% 
IMG OID640594706 
Productpeptidase S41 
Protein accessionYP_001275450 
Protein GI148655245 
COG category[S] Function unknown 
COG ID[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.588085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0332466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC ACGGATATTA TCGCTGGCCA ACCATTCATG ACGACACCGT TGTCTTTGTT 
TGTGAAGATG ATCTCTGGTT GGTTGCCGCA TCGGGCGGCG TGGCACGACG GTTGACCGCG
AACCCTGGCA GTGTGCAGTC GCCAGCGCTC TCCCCCGATG GTACATTGCT GGCATTTGTC
GGACGCGATG AGGGTCCGGG GGAAGTCTTT GTGATGCTGG CAGTGGGCGG TGAAGCGCGT
CGGCTGACGT TTCTCGGTGA AACAATGCGC GTATGCGGAT GGAGTCGCAA TGGGCGCGAT
ATTCTGTTTG CCAGTTCTGC GCATTCACCG TTTTCGCGAT CTCCCCTGCT GTACGCTGTC
GCTGCCGATG GCGGCGAGCC GCGCCTTCTT CCGACCGGTC CGGCGGTTCA CGTGTCGTAT
GGACCAGACG GCGGCATGGT CATCGGGCGC AATGAGAGCG ACCCGGCGCG CTGGAAGCGG
TATCGCGGCG GACGCACCGG CGATGTGTGG ATCGACCCGG ATGGCAGCGG CGAGTGGCGG
CGTCTGATCT CGCTCCCCGG CAATATCGCC ATTCCGCTGT GGGTTGGCGA CCGGATCTAT
TTTGTGTCCG ATCACGAAGG CGTCGGAAAT CTCTATTCAT GCCTGCCGAC CGGCGAGGAC
CTGCAACGCC ATACCTGGCA CCGCGAATAC TATGCGCGTT TTCCGTCCAC CGATGGACGG
CGGATCGTCT ACCATGCTGG CGCGGATCTC TACCTGTTTG ATCCGGCAAC GAATGGATCG
CGCAAGATCG AGATCGAACT GCACAGCCCG CGAACGCAGC GAAAGCGTCG TTTTGTCGAT
CCGGCGCGTT TTCTTCAAAG TGTTGCACTG CATCCGGAGG GGCACTCGCT CGTCGCCGTC
GTTCGCGGCA AGCCGTTTAC ATTCGGCAAC TGGGAAGGGG CTGTGTTGCA GTACGGCGAT
CCTGGCGCAG TGCGCTATCG CCTGGCTGAC TGGTTGCCCG ACGGCAGGCG GATTGTGGTG
GTAAGTGATG CCGCAGGCGA AGAGATGCTG GAAGTCCACC CGGTCACATT GGGCAATGGT
CAGGTCGCTC CCAGAACGGA CGTCGCGGAT GTCCAGCCTG GAACGGGATC ATCGACATTG
CTGTTTGAGG AACCGGTGCG CCTGGACGGA CTCGATATCG GTCGTCCTCT GACGCTCGCC
GTCTCACCCA AAGCGCCGCT TGTCGCGCTT GCGAATAACC GGAATGAATT GCTGCTGGTC
GATCTGAATG ATCGCTCCGT GCGGCTGCTT GATCGCAGTC GATATGCCTC TATGCCCGGC
ATCGCCTGGT CGCCAGATGG ACGCTGGCTT GCGTATGGCT TTTGGGAAAC GGAGCAGACA
TCGGTTATTA AACTGTGCGA GATCGCCACC GGGACGATCA CCCCGGTCAC GCGACCGGTG
CTGGTCGATC GATCTCCGGC GTTCGATCCA GAGGGAAAGT ATCTCTATTT TATTTCATAC
CGCGATCTCG ATCCGGTGCG TGATGATATT CATTTCGACC TGGGATTTCC CCGCGGTGCG
CGTCCATTTC TGGTGACGTT GCGCGCCGAT CTGCGTTCGC CGTTTGTGCC GGGTCCGCAT
CCGCTGGAAC GACCGACGGC GAAGCCTGCT TCAGGTGAAG CGTCGTCGGG TCAGGAAGAA
GCCACTGCTC CGAAAGAGGC GTCGTCCGAG AAAAGCGTCG TGATCGATCT CGAAGGCATC
GCCGACCGGA TTGTCGCGTT TCCCGTACCG GTTGGGCGGT ATGGGCAGAT CGCGGGGATA
CCGGGAAAGG CGCTCTTTAC TGTTTTTCCA ATCGAAGGCA TGCTGAGTCA GGCGCACATG
TCGGGCAGTG CGTCAGCGAG TCGCGGGCGT CTCGATGTCT ACGATTTCGA GACCCTGAGT
AGCGACACGT TGATCGATGG CGTCTCGCGC TTTGCCCTTT CACGCGATGC GAAGACGCTG
ATCTACCGTT CCGGCAATCG GGTGCGCGTT GTGAGAGCAG GCGAGAAACC GAAGGATAAC
AGCCCTGAGC CTGGACGGAA GAGCGGATGG ATCGATCTCG CGCGCATCAA ACTGCTGGTC
TCGCCGCCGG CGGAGTGGAG GCAGATGTAC CGCGAAGCCT GGCGTCTCCA GCGCGATCAT
TTCTGGACGC CGGATATGTC GGGAGTCAAC TGGCTGGCGG TCTATCAGCG CTACCTGCCG
TTGCTTGATC GGGTTGCAAC GCGCGGCGAA TTTTCCGATC TGCTGTGGGA GATGCAGGGC
GAACTGGGAA CATCGCATGC CTACGAATAT GGTGGTGATT ACCGTCCTGA GCCGCGCTAC
AGCCCAGGCA GACTGGGCGC AGATCTGCGC TACGACGCCG AAACCGACAG TTATGTGGTC
GAGCGAGTGA TCCGGGGTGA TGTATGGGAC GAGCGCGCCA GTTCGCCGCT GGCGCAGCCA
GGGATCAACA TCGTGCCCGG CGACCGCCTG ATCGCAGTCG GCGGGCATCG GGTCGGGCGA
AACGTATCGC CGCACGAATT GCTGATCAAC CAGGCGGGCA GCGATGTGTT GTTGACCTTT
ATGAAGATGG ACGGTACGCT TCGATCGGTG ACCGTTAAGG CGCTCTACGA CGAGAGTCGC
GCGCGCTATC GGGAATGGGT CGAACGGAAC CGGCAGATCG TCCACGACGC AACGCAGGGG
CGCGTCGGGT ATCTCCATAT CCCCGATATG CAGGCACACG GGTATGCCGA GTTCCACCGC
GGCTTTCTTG CCGGGGTGGT GTATGAAGGG TTGATCGTCG ACCTGCGGTA TAATACGGGC
GGCTTCGTTT CGCCGTTAAT CGTCGAAAAA CTGGCGCGAA AGCGCCTCGG ATACGGTGTT
TCACGCTGGG GCGAACCCGA ACCCTACCCG CCGGAGTCGG TAATGGGACC AATGGTGGCG
ATCATTAACG AAGCGGCCGG ATCCGACGGC GATATCATCA GCCACGTGTT CAAAATGATG
AAACTCGGTC CGCTGATCGG CAAGCGCACC TGGGGCGGGG TCATCGGCAT CTATCCACGC
GATACCCTGA TCGACGGCGG TGTGACCACG CAACCGGAGT TTTCCTTCTG GTCGGCGGAG
GCGGGCTGGC AACTGGAGAA TCGCGGCGTT GAACCGGATA TTGAAGTCGA AATGCGACCA
CAGGATTACG TTGCGGGCGT CGATCCGCAA CTCGAGCGCG CGATTGCCGA AGTGCTGCGC
CTGATGCAGG ATCACGCGCC CAAACTTCCC GATTTCGGCG AACGACCACG CCTGCCCTTG
CCGGAGGAAC GTTGA
 
Protein sequence
MSTHGYYRWP TIHDDTVVFV CEDDLWLVAA SGGVARRLTA NPGSVQSPAL SPDGTLLAFV 
GRDEGPGEVF VMLAVGGEAR RLTFLGETMR VCGWSRNGRD ILFASSAHSP FSRSPLLYAV
AADGGEPRLL PTGPAVHVSY GPDGGMVIGR NESDPARWKR YRGGRTGDVW IDPDGSGEWR
RLISLPGNIA IPLWVGDRIY FVSDHEGVGN LYSCLPTGED LQRHTWHREY YARFPSTDGR
RIVYHAGADL YLFDPATNGS RKIEIELHSP RTQRKRRFVD PARFLQSVAL HPEGHSLVAV
VRGKPFTFGN WEGAVLQYGD PGAVRYRLAD WLPDGRRIVV VSDAAGEEML EVHPVTLGNG
QVAPRTDVAD VQPGTGSSTL LFEEPVRLDG LDIGRPLTLA VSPKAPLVAL ANNRNELLLV
DLNDRSVRLL DRSRYASMPG IAWSPDGRWL AYGFWETEQT SVIKLCEIAT GTITPVTRPV
LVDRSPAFDP EGKYLYFISY RDLDPVRDDI HFDLGFPRGA RPFLVTLRAD LRSPFVPGPH
PLERPTAKPA SGEASSGQEE ATAPKEASSE KSVVIDLEGI ADRIVAFPVP VGRYGQIAGI
PGKALFTVFP IEGMLSQAHM SGSASASRGR LDVYDFETLS SDTLIDGVSR FALSRDAKTL
IYRSGNRVRV VRAGEKPKDN SPEPGRKSGW IDLARIKLLV SPPAEWRQMY REAWRLQRDH
FWTPDMSGVN WLAVYQRYLP LLDRVATRGE FSDLLWEMQG ELGTSHAYEY GGDYRPEPRY
SPGRLGADLR YDAETDSYVV ERVIRGDVWD ERASSPLAQP GINIVPGDRL IAVGGHRVGR
NVSPHELLIN QAGSDVLLTF MKMDGTLRSV TVKALYDESR ARYREWVERN RQIVHDATQG
RVGYLHIPDM QAHGYAEFHR GFLAGVVYEG LIVDLRYNTG GFVSPLIVEK LARKRLGYGV
SRWGEPEPYP PESVMGPMVA IINEAAGSDG DIISHVFKMM KLGPLIGKRT WGGVIGIYPR
DTLIDGGVTT QPEFSFWSAE AGWQLENRGV EPDIEVEMRP QDYVAGVDPQ LERAIAEVLR
LMQDHAPKLP DFGERPRLPL PEER