Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1513 |
Symbol | |
ID | 5208468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1842144 |
End bp | 1845494 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640595121 |
Product | protease domain-containing protein |
Protein accession | YP_001275857 |
Protein GI | 148655652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0327061 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACACC TCGCACGCTA TGCTCGAATC ACGGCGTTCA TGCTCGTACT CGTGGTCGCG TTGGCGTCTG TGGCGCCGAT GCGCAACGTT GCAGCCCAAA CCCGACCAAA CCGTCCCACG CCTGGCACGC CTCTGACGAT CGAACCGGTA ACCGTCGTCA GTCGTGAAGA CCTTGCCGTC AACAAGAATC TGGCCGTGTC GCGCGATGGC ACCCTGGCGA GCGTTTTCAT CAAGATCGAC TCGCCATCGC TGGCATCATA CATGGCGCAG AATGGCATCA CCGACATTAA TGCGCCAGCT GCTCAGAGTT ACCTGCAGCA GTTGAACGCG GAACTTGATG CGCTGGTTGC GCAGGCGAAG CAGCGCGTTC CCGGCTTGCG CGTCACCCAT CGCTTCGATC TGATCATCGG CGGTGTCTCT GTTGTGGCGC CGGTCGGTGA GATTGACAAA CTGCGACGCC TGCCGAATGT GGTGGAGATT ATCAACGACC GCATCGAGCG GATCGAAACC TACCGTACAC CGGCATTCAT CGGCGCAACA ACCGCCTGGG GCAAGGGTGG CGGCTCCGCC TTCGCTGGCG AAGGGGTCAT TTTTGGTGTG CTCGATAGCG GCGTCTGGCC CGAACATCCG TCGTTCTCCG ATCCTGATCC GCTGGGGAAA CCGTATGCGC CGCCGCCACC CGCACCCGGC AATCCCGGCG GCGTTCGTGC CTGCAATTTC GGCAGTGCGA CGCCTGGTGA TGCACCTTTC ACCTGCAACA ACAAACTGAT CGGTTCATAC CGCTTCATGA CGGCCTACGA CTTCTTCGTC GGTACGGAAC CGTATGAATT TCGGTCTGGC CGTGATGACG ACGGTCACGG CACCCATACC GCTTCCACGG CCGCCGGTAA CCGTGGCGTT CCAGCCAGCG ATGGGAGCCG CGTTTTCGGT ACTATCTCCG GCATCGCACC GCGCGCATAT GTTGTCAATT ATAAGGTGTG CGGTGAAGTA GGCTGCTTTA CAACCGACTC GGCAGCTGCG GTGCAGCAGG CAATCCGTGA TGGCGTCCAC GTCATCAACT TTTCGATCAG TGGCGGAACC AATCCGTACA GCGACATCGC CTCACTCGCC TTCCTCGACG CCTATAATGC CGGTATTCTG GTTTCTGCCT CGGCGGGCAA CTCCGGTCCG GCAGCCGACA CGGTCAACCA CCGCGAACCC TGGGTTGCGA CCGTTGGCGC CAGCACGTCG GATCGGTCGT ACCTCAGCAC GCTGACGGTT CAGGGAGTAA GTGGTACGTT CACTGCTGTT GGCGCTTCCA GTGGCGCCGG GATTTCCACG CCCGCGCCGA TTGTGGTGAA CACGGCTGAT CCACTGTGCC AGAACCCGGC GCTCCCAGGA ACATTTACGG GGAAGATTGT GGTGTGTCGG CGTGGAGTGA TCGCACGGGT GGCGAAGAGC GCGAACGTGG CAGCTGGCGG CGCGATTGGC ATGATCCTGT ACAATCCGAC GCCGAACAGT CTCGACGCCG ATTTCCATGT CATTCCGACC GTTCACCTTC AGAACACCGA CGGCACTGCG CTGCTGACAT TCCTGACCGC CAATCCCGGC GCGACGGCGA CCTTCACCCC TGGCGCACCC GGACCAATCC AGGGTGACGT GATGGCGGGC TTCAGCTCAC GCGGCGGTCC CGGTCAGACC CTTGGTATCA GCAAGCCAGA CGTTACCGCA CCGGGTGTCA ATATTCTGGC TGGCTACACC GCCATCGAAT ATGGACAACC AGTGCCACAA TTCGCATTTC TGAGCGGCAC GTCGATGTCC AGCCCGCACA ACGCCGGTGC TGCGATCCTG CTCAAATGGC TCAATCCAAC CTGGACGCCT GGGCAGATCA AGTCAGCCCT GATGACCAGC GCCAGGAGTG CAGGCGTCTT CAAGGAAGAC GGGGTAACGC CGTTCACACC GTTCGACGCC GGTTCAGGGC GCATCGATCT CCGCAAGGCG TGGGATCCTG GTCTGACCTT CGACGAAACC GGTGCCGGTT ATGTGGCGTT GAAAGATGAA CTGTGGAAGG CGAACTATCC GAGCCTGTAT GTGCCGAAGA TGCCGGGTCG TATTACCGTC AGCCGAACGG TGCGCGAAGT GTCCGGGTAC GATAGTTTCT ACAAGAGTTC CATCTCGTAC CGGGCAGGGC AACCGCGCGA CTTCACGATC ACGGTGCCGC GCGAGTTCTT CGTGCCAGCT AACGGCACCT ACACCTTCGA CATCACAGTT GATGCCCGTG ATGTGCCGGT GGGTCAGGTG CGCCATGCGG TCGTCGTCTT CACCGAGCGC AACGGGTGCC GGGTGCGCTT CCCGATCACC ATCGTGCGCG GCGAACCCGA TATTCGCATG GACAAGCAGT GCAATCCGGC AACGTTGGCG TTGCGTGGAA CGACCGACTG CACCATCTCG ATCACCAACA CCACCTTCCA GAACGCTTCT GTGACGCTGA ACGATACAAT GCCGCGCCAG TTGAAGCTGG TGAGCGGGAG CGTCACCGGT GGCGCAACTG AGGTCGACAA TGGGTTGACC TTCAGCGGGA CCCTTACCGG AGCAGCACCA CCTGATGTGA GTGCAGGTCG TCTGACCTTC AACGGCTATC TGTCGCTTGC TGGACTCGGC GCGTCGCCCA ACGTCCCCCT CGGCGACGAG ACCATCGTTA ACCTTACTCT GACGCGTCCA TTCCTGTTTG GTGGGCAAAC CTACGACACT ATTGGTATGG TGAGTAACGG TTACGCCGTC GTTGGTGGCG GGACAGCGGC AGATGTGCAG TTCATCAATC GCACCTTCCC CAACACTGCC CGCCCGAATA ATACCCTGGG AGCCTTCTGG ACCGATCTCG ACGGGAGTGC TGGCGGCAGT TACTATGCGT ACCTGGTCGG CTTCGGTCCG TGCAGCAACC CGGCGAACGC CTGCTGGCTG ATCCTGGAAT GGGAGAACGC ACCGAACTGG AGCAATAACA GTCAGCGGAA CACCTTCCAG ATCTGGATCG GGTTGAACGG CGTGGAGGAT ATTACCTACT CCTACGGTCC GATTCTCTCC AGCGGCGATG GCGGTTATGT CACCGTCGGG GCCGAAAATG CCTTCGGCAA TCGCGGGGCG AACATCTACA GCAACGACGG TGTTGGTCCG ATCATCGGCA CAATTCCCAC GGCGAACTCG AACGATGTGT ACATCGAGAC GACCCCCGGC GCTCCGGGGC AGACCGTCAC CATCGGTTTC CGGGCGCAGG GCGTGCGCGT CGGGCGCTGG ACGAACTATG CGGAACTGAC CAGCCCGGCG TTCTTCGGCA CGTACATTGA TCCCTTCAGT GGTGAGGTCG TGCGACCGTA A
|
Protein sequence | MRHLARYARI TAFMLVLVVA LASVAPMRNV AAQTRPNRPT PGTPLTIEPV TVVSREDLAV NKNLAVSRDG TLASVFIKID SPSLASYMAQ NGITDINAPA AQSYLQQLNA ELDALVAQAK QRVPGLRVTH RFDLIIGGVS VVAPVGEIDK LRRLPNVVEI INDRIERIET YRTPAFIGAT TAWGKGGGSA FAGEGVIFGV LDSGVWPEHP SFSDPDPLGK PYAPPPPAPG NPGGVRACNF GSATPGDAPF TCNNKLIGSY RFMTAYDFFV GTEPYEFRSG RDDDGHGTHT ASTAAGNRGV PASDGSRVFG TISGIAPRAY VVNYKVCGEV GCFTTDSAAA VQQAIRDGVH VINFSISGGT NPYSDIASLA FLDAYNAGIL VSASAGNSGP AADTVNHREP WVATVGASTS DRSYLSTLTV QGVSGTFTAV GASSGAGIST PAPIVVNTAD PLCQNPALPG TFTGKIVVCR RGVIARVAKS ANVAAGGAIG MILYNPTPNS LDADFHVIPT VHLQNTDGTA LLTFLTANPG ATATFTPGAP GPIQGDVMAG FSSRGGPGQT LGISKPDVTA PGVNILAGYT AIEYGQPVPQ FAFLSGTSMS SPHNAGAAIL LKWLNPTWTP GQIKSALMTS ARSAGVFKED GVTPFTPFDA GSGRIDLRKA WDPGLTFDET GAGYVALKDE LWKANYPSLY VPKMPGRITV SRTVREVSGY DSFYKSSISY RAGQPRDFTI TVPREFFVPA NGTYTFDITV DARDVPVGQV RHAVVVFTER NGCRVRFPIT IVRGEPDIRM DKQCNPATLA LRGTTDCTIS ITNTTFQNAS VTLNDTMPRQ LKLVSGSVTG GATEVDNGLT FSGTLTGAAP PDVSAGRLTF NGYLSLAGLG ASPNVPLGDE TIVNLTLTRP FLFGGQTYDT IGMVSNGYAV VGGGTAADVQ FINRTFPNTA RPNNTLGAFW TDLDGSAGGS YYAYLVGFGP CSNPANACWL ILEWENAPNW SNNSQRNTFQ IWIGLNGVED ITYSYGPILS SGDGGYVTVG AENAFGNRGA NIYSNDGVGP IIGTIPTANS NDVYIETTPG APGQTVTIGF RAQGVRVGRW TNYAELTSPA FFGTYIDPFS GEVVRP
|
| |