Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1092 |
Symbol | |
ID | 5208039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1356259 |
End bp | 1359573 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640594706 |
Product | peptidase S41 |
Protein accession | YP_001275450 |
Protein GI | 148655245 |
COG category | [S] Function unknown |
COG ID | [COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.588085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0332466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC ACGGATATTA TCGCTGGCCA ACCATTCATG ACGACACCGT TGTCTTTGTT TGTGAAGATG ATCTCTGGTT GGTTGCCGCA TCGGGCGGCG TGGCACGACG GTTGACCGCG AACCCTGGCA GTGTGCAGTC GCCAGCGCTC TCCCCCGATG GTACATTGCT GGCATTTGTC GGACGCGATG AGGGTCCGGG GGAAGTCTTT GTGATGCTGG CAGTGGGCGG TGAAGCGCGT CGGCTGACGT TTCTCGGTGA AACAATGCGC GTATGCGGAT GGAGTCGCAA TGGGCGCGAT ATTCTGTTTG CCAGTTCTGC GCATTCACCG TTTTCGCGAT CTCCCCTGCT GTACGCTGTC GCTGCCGATG GCGGCGAGCC GCGCCTTCTT CCGACCGGTC CGGCGGTTCA CGTGTCGTAT GGACCAGACG GCGGCATGGT CATCGGGCGC AATGAGAGCG ACCCGGCGCG CTGGAAGCGG TATCGCGGCG GACGCACCGG CGATGTGTGG ATCGACCCGG ATGGCAGCGG CGAGTGGCGG CGTCTGATCT CGCTCCCCGG CAATATCGCC ATTCCGCTGT GGGTTGGCGA CCGGATCTAT TTTGTGTCCG ATCACGAAGG CGTCGGAAAT CTCTATTCAT GCCTGCCGAC CGGCGAGGAC CTGCAACGCC ATACCTGGCA CCGCGAATAC TATGCGCGTT TTCCGTCCAC CGATGGACGG CGGATCGTCT ACCATGCTGG CGCGGATCTC TACCTGTTTG ATCCGGCAAC GAATGGATCG CGCAAGATCG AGATCGAACT GCACAGCCCG CGAACGCAGC GAAAGCGTCG TTTTGTCGAT CCGGCGCGTT TTCTTCAAAG TGTTGCACTG CATCCGGAGG GGCACTCGCT CGTCGCCGTC GTTCGCGGCA AGCCGTTTAC ATTCGGCAAC TGGGAAGGGG CTGTGTTGCA GTACGGCGAT CCTGGCGCAG TGCGCTATCG CCTGGCTGAC TGGTTGCCCG ACGGCAGGCG GATTGTGGTG GTAAGTGATG CCGCAGGCGA AGAGATGCTG GAAGTCCACC CGGTCACATT GGGCAATGGT CAGGTCGCTC CCAGAACGGA CGTCGCGGAT GTCCAGCCTG GAACGGGATC ATCGACATTG CTGTTTGAGG AACCGGTGCG CCTGGACGGA CTCGATATCG GTCGTCCTCT GACGCTCGCC GTCTCACCCA AAGCGCCGCT TGTCGCGCTT GCGAATAACC GGAATGAATT GCTGCTGGTC GATCTGAATG ATCGCTCCGT GCGGCTGCTT GATCGCAGTC GATATGCCTC TATGCCCGGC ATCGCCTGGT CGCCAGATGG ACGCTGGCTT GCGTATGGCT TTTGGGAAAC GGAGCAGACA TCGGTTATTA AACTGTGCGA GATCGCCACC GGGACGATCA CCCCGGTCAC GCGACCGGTG CTGGTCGATC GATCTCCGGC GTTCGATCCA GAGGGAAAGT ATCTCTATTT TATTTCATAC CGCGATCTCG ATCCGGTGCG TGATGATATT CATTTCGACC TGGGATTTCC CCGCGGTGCG CGTCCATTTC TGGTGACGTT GCGCGCCGAT CTGCGTTCGC CGTTTGTGCC GGGTCCGCAT CCGCTGGAAC GACCGACGGC GAAGCCTGCT TCAGGTGAAG CGTCGTCGGG TCAGGAAGAA GCCACTGCTC CGAAAGAGGC GTCGTCCGAG AAAAGCGTCG TGATCGATCT CGAAGGCATC GCCGACCGGA TTGTCGCGTT TCCCGTACCG GTTGGGCGGT ATGGGCAGAT CGCGGGGATA CCGGGAAAGG CGCTCTTTAC TGTTTTTCCA ATCGAAGGCA TGCTGAGTCA GGCGCACATG TCGGGCAGTG CGTCAGCGAG TCGCGGGCGT CTCGATGTCT ACGATTTCGA GACCCTGAGT AGCGACACGT TGATCGATGG CGTCTCGCGC TTTGCCCTTT CACGCGATGC GAAGACGCTG ATCTACCGTT CCGGCAATCG GGTGCGCGTT GTGAGAGCAG GCGAGAAACC GAAGGATAAC AGCCCTGAGC CTGGACGGAA GAGCGGATGG ATCGATCTCG CGCGCATCAA ACTGCTGGTC TCGCCGCCGG CGGAGTGGAG GCAGATGTAC CGCGAAGCCT GGCGTCTCCA GCGCGATCAT TTCTGGACGC CGGATATGTC GGGAGTCAAC TGGCTGGCGG TCTATCAGCG CTACCTGCCG TTGCTTGATC GGGTTGCAAC GCGCGGCGAA TTTTCCGATC TGCTGTGGGA GATGCAGGGC GAACTGGGAA CATCGCATGC CTACGAATAT GGTGGTGATT ACCGTCCTGA GCCGCGCTAC AGCCCAGGCA GACTGGGCGC AGATCTGCGC TACGACGCCG AAACCGACAG TTATGTGGTC GAGCGAGTGA TCCGGGGTGA TGTATGGGAC GAGCGCGCCA GTTCGCCGCT GGCGCAGCCA GGGATCAACA TCGTGCCCGG CGACCGCCTG ATCGCAGTCG GCGGGCATCG GGTCGGGCGA AACGTATCGC CGCACGAATT GCTGATCAAC CAGGCGGGCA GCGATGTGTT GTTGACCTTT ATGAAGATGG ACGGTACGCT TCGATCGGTG ACCGTTAAGG CGCTCTACGA CGAGAGTCGC GCGCGCTATC GGGAATGGGT CGAACGGAAC CGGCAGATCG TCCACGACGC AACGCAGGGG CGCGTCGGGT ATCTCCATAT CCCCGATATG CAGGCACACG GGTATGCCGA GTTCCACCGC GGCTTTCTTG CCGGGGTGGT GTATGAAGGG TTGATCGTCG ACCTGCGGTA TAATACGGGC GGCTTCGTTT CGCCGTTAAT CGTCGAAAAA CTGGCGCGAA AGCGCCTCGG ATACGGTGTT TCACGCTGGG GCGAACCCGA ACCCTACCCG CCGGAGTCGG TAATGGGACC AATGGTGGCG ATCATTAACG AAGCGGCCGG ATCCGACGGC GATATCATCA GCCACGTGTT CAAAATGATG AAACTCGGTC CGCTGATCGG CAAGCGCACC TGGGGCGGGG TCATCGGCAT CTATCCACGC GATACCCTGA TCGACGGCGG TGTGACCACG CAACCGGAGT TTTCCTTCTG GTCGGCGGAG GCGGGCTGGC AACTGGAGAA TCGCGGCGTT GAACCGGATA TTGAAGTCGA AATGCGACCA CAGGATTACG TTGCGGGCGT CGATCCGCAA CTCGAGCGCG CGATTGCCGA AGTGCTGCGC CTGATGCAGG ATCACGCGCC CAAACTTCCC GATTTCGGCG AACGACCACG CCTGCCCTTG CCGGAGGAAC GTTGA
|
Protein sequence | MSTHGYYRWP TIHDDTVVFV CEDDLWLVAA SGGVARRLTA NPGSVQSPAL SPDGTLLAFV GRDEGPGEVF VMLAVGGEAR RLTFLGETMR VCGWSRNGRD ILFASSAHSP FSRSPLLYAV AADGGEPRLL PTGPAVHVSY GPDGGMVIGR NESDPARWKR YRGGRTGDVW IDPDGSGEWR RLISLPGNIA IPLWVGDRIY FVSDHEGVGN LYSCLPTGED LQRHTWHREY YARFPSTDGR RIVYHAGADL YLFDPATNGS RKIEIELHSP RTQRKRRFVD PARFLQSVAL HPEGHSLVAV VRGKPFTFGN WEGAVLQYGD PGAVRYRLAD WLPDGRRIVV VSDAAGEEML EVHPVTLGNG QVAPRTDVAD VQPGTGSSTL LFEEPVRLDG LDIGRPLTLA VSPKAPLVAL ANNRNELLLV DLNDRSVRLL DRSRYASMPG IAWSPDGRWL AYGFWETEQT SVIKLCEIAT GTITPVTRPV LVDRSPAFDP EGKYLYFISY RDLDPVRDDI HFDLGFPRGA RPFLVTLRAD LRSPFVPGPH PLERPTAKPA SGEASSGQEE ATAPKEASSE KSVVIDLEGI ADRIVAFPVP VGRYGQIAGI PGKALFTVFP IEGMLSQAHM SGSASASRGR LDVYDFETLS SDTLIDGVSR FALSRDAKTL IYRSGNRVRV VRAGEKPKDN SPEPGRKSGW IDLARIKLLV SPPAEWRQMY REAWRLQRDH FWTPDMSGVN WLAVYQRYLP LLDRVATRGE FSDLLWEMQG ELGTSHAYEY GGDYRPEPRY SPGRLGADLR YDAETDSYVV ERVIRGDVWD ERASSPLAQP GINIVPGDRL IAVGGHRVGR NVSPHELLIN QAGSDVLLTF MKMDGTLRSV TVKALYDESR ARYREWVERN RQIVHDATQG RVGYLHIPDM QAHGYAEFHR GFLAGVVYEG LIVDLRYNTG GFVSPLIVEK LARKRLGYGV SRWGEPEPYP PESVMGPMVA IINEAAGSDG DIISHVFKMM KLGPLIGKRT WGGVIGIYPR DTLIDGGVTT QPEFSFWSAE AGWQLENRGV EPDIEVEMRP QDYVAGVDPQ LERAIAEVLR LMQDHAPKLP DFGERPRLPL PEER
|
| |