Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0158 |
Symbol | |
ID | 5207091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 201996 |
End bp | 204902 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640593786 |
Product | peptidase M16C associated domain-containing protein |
Protein accession | YP_001274544 |
Protein GI | 148654339 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.471453 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCA CACACGGCTT TGAACTGCTC CGTGAGCAGC AGATATCCGA GTTGAACACC CTGGCGCGTC TGTATCGCCA TGTCGCCACC GGCGCCGAAC TTCTTTCGCT GATCAACGAT GACGAAAATA AGGTCTTCGG GATTACCTTC CGTACCCCGC CACCCGACTC GACCGGCGTG GCGCATATTC TCGAACACAG CGTCCTGTGC GGCTCCGAGA AATACCCATT GAAGAAGCCG TTCGTCGAGT TGCTCAAAGG ATCGCTCAAA ACATTCCTCA ATGCCATCAC CTTTTCGGAT AAAACTGTCT ATCCGGTTGC GTCCACAAAT ACAAAGGATT TCTACAATCT GATCGATGTC TACCTCGATG CCGTCTTTCA TCCGCGCATC ACGCCAGAGG TGTTGCAGCA GGAAGGGTGG CGCTATGAAC TGAATGAGGA CGGGTCGTTG GGATACCGCG GCGTGGTCTT CAACGAGATG AAGGGCGCCA ATGCTTCACC CGACCGGGTG CTCTATGTTG CAGTGCAACG GTCGCTGTTC CCCGGTCATA TCTACAGCGT CGACTCTGGC GGCGATCCGG CGGTTATTCC AAACCTGACC TACGAACAGT TCAGGGCGTT TCATGAGCGC TACTACCATC CTTCCAACGC CCTGATCTTC TTCTACGGCG ATGATGACCC GGAAGAACGC CTGCGCCTGC TGGAGCGCGT GCTGGCGCCT TTCGAGCGCA TTTCTGTCGA TGCGACAATC CCGTTGCAAC CGCCATTCCG CGAACCGCAA CGCCTTGAGG TTCCATATCC CGCCGGTCCG AACAGCGCCG ACAAACATAT GGTGACGGTC AACTGGCTCC TGCCCGATCC ACCCGATGTT GAAGAAGCGC TGGCGCTCGA CATCCTGGAG CATGCGCTGG TCGGCACGCC GGCTTCCCCG CTGCGCAAAG CCCTGATCGA TTCCGGGCTG GGAGAGAACC TCACCGGTTC GGGGTTTGCC CGCCTGCGTC AGACGTTCTT TACCGTGGGT CTGAAAGGGG TCAAAGGCGA ACACGTGCAC GCTGTCGAGA ACATGATTAT CGATACGCTT GGACGCCTGG TGCACGACGG GATCGATCCG CAAACGATCG AAGCGGCAGT CAACACGGTC GAGTTTCAGT TGCGCGAGAA CAACACCGGT TCGTATCCAC GCGGTCTGGT CGTGCTGTTC CGTGCGCTCG ACACCTGGCT CTACGGCGAG GATCCACTGG CGCCGTTGAT GTTCGAGGCG CCGCTACGCG CAGTCAAGCA GCGTCTGCAC AACGGAGGGC GCTTCTTCGA GCGCCTGATC GAAGAGCGGC TCCTGCGCAA TCCGCACCGC ACAACAGTCG TGCTCGTGCC TGATCTTGAA TTGACCAATC GCCAGAACGC TGCCGAGCGT GAGCGCCTCG CGGCGATCCG CGCCACCCTC GATGATGCAC AGATCGAACA GATCGCCACA ACCGCTGCGC GTCTCAAGCA GATCCAGGAG ACGCCCGATC CGCCGGAGGC GCTTGCGTTG CTCCCCAGTC TGACGATTGC CGATCTCGAC CGGAAGATCA AAACAACGCC TACCGAAGAG ATGCACATCG GTGCAACACG TGTGCTGCTG CACGACCTTT TTACCAACGG GATCGTGTAT ATCGACGTTG GCATGAACCT GCACACGCTG CCGCAGGAGT TGCTCCCATA TGTCACTATT TTCGGGCGTG CGCTCCTCGA AACCGGCACG CAGCACGACG ACATCATCCA GTTGACGCAG CGGATCGGGC GCGATACCGG CGGCATCTTT CCCCAAACGT TCACGTCCGC GATGCGTGGG CAGAGTGATG GCGCCGCCTG GCTGTTCCTG CGCGGGAAGG CAATTCTGGA GAAAAGCGAT GCGCTGCTCG ACATCCTGCA CGACGTTGTG CACTCCGCCC GTCTTGACAA CCGCGACCGC ATTCGCCAGA TTGTGCGCGA AGAACGTGCG TCGCGTGAAG CCAGCCTGAT CCCGGCTGGT CACACGGTCG TCAACACACG CCTGCGCGCA CGGTTCAACG AAGCCGACTG GGCAGCGGAA CAGATCGGCG GGGTCAGCTA CCTCCTCTTC CTGCGGCGTG TCGAGCGGGC TATCGATGAG GAATGGGATA CAGTATACAC TGTACTGGAG CGGATGCGCA CCCTGCTGGT CAATCGGAGC GCCCTGCTGG TTAACGTGAC TGTGGACGCT GCCGGTTGGG ATCGGTTCCG CCCCCGTCTC GAAGCATTTC TTGACCGGCT GCCCGCTGGC GAATCTGTGC TGGCGGCGTG GAACCCGCAG CCCGGCGCAC CATCAGAAGG GTTGCTCATT CCCGCAAACG TGAACTACGT TGCCAAAGGC GCCAGCCTGT ATCGCCTGGG GTACCGGCTG CACGGCTCGG CGCTGGTGGT GACGCGCTAC CTGATGACCA CCTGGCTATG GGAACAGATC CGCGAGCAGG GTGGCGCTTA CGGCGGCTTC TGCTCGTTCG ACCCGCGATC CGGCATGTTC AGTTACACGT CGTACCGCGA CCCCAACCTG CTGCGCACCA TCGAGGTCTA CGACCGTTCC GCCGAATTTT TGCGCCAGCT CGAATTGAGC GAGAAGGAGT TGACCCGCGC CATCATCGGC GTCATCGCCG AACTCGACGC ATACCAGCTC CCCGACGCAC GCGGTTTTAC CGCAATGGCG CGCCATATCG TCGGTGATGA TGACGCCTAT CGCCAGCAGG TGCGCGACGA GGTGCTGGGC ACGACGCCCG CCGACTTCCG TGCGTTTGCC GATGTGCTCG ACATGCTGCG CGAAAACGCT GCGCTCGTTG TGATGGGAAA TGAAGACGCC ATAACCGCCG CCAATCAGGA ACGTGCGTTG TTTGCCGCCA TCACACGCGT GCTGTAA
|
Protein sequence | MNITHGFELL REQQISELNT LARLYRHVAT GAELLSLIND DENKVFGITF RTPPPDSTGV AHILEHSVLC GSEKYPLKKP FVELLKGSLK TFLNAITFSD KTVYPVASTN TKDFYNLIDV YLDAVFHPRI TPEVLQQEGW RYELNEDGSL GYRGVVFNEM KGANASPDRV LYVAVQRSLF PGHIYSVDSG GDPAVIPNLT YEQFRAFHER YYHPSNALIF FYGDDDPEER LRLLERVLAP FERISVDATI PLQPPFREPQ RLEVPYPAGP NSADKHMVTV NWLLPDPPDV EEALALDILE HALVGTPASP LRKALIDSGL GENLTGSGFA RLRQTFFTVG LKGVKGEHVH AVENMIIDTL GRLVHDGIDP QTIEAAVNTV EFQLRENNTG SYPRGLVVLF RALDTWLYGE DPLAPLMFEA PLRAVKQRLH NGGRFFERLI EERLLRNPHR TTVVLVPDLE LTNRQNAAER ERLAAIRATL DDAQIEQIAT TAARLKQIQE TPDPPEALAL LPSLTIADLD RKIKTTPTEE MHIGATRVLL HDLFTNGIVY IDVGMNLHTL PQELLPYVTI FGRALLETGT QHDDIIQLTQ RIGRDTGGIF PQTFTSAMRG QSDGAAWLFL RGKAILEKSD ALLDILHDVV HSARLDNRDR IRQIVREERA SREASLIPAG HTVVNTRLRA RFNEADWAAE QIGGVSYLLF LRRVERAIDE EWDTVYTVLE RMRTLLVNRS ALLVNVTVDA AGWDRFRPRL EAFLDRLPAG ESVLAAWNPQ PGAPSEGLLI PANVNYVAKG ASLYRLGYRL HGSALVVTRY LMTTWLWEQI REQGGAYGGF CSFDPRSGMF SYTSYRDPNL LRTIEVYDRS AEFLRQLELS EKELTRAIIG VIAELDAYQL PDARGFTAMA RHIVGDDDAY RQQVRDEVLG TTPADFRAFA DVLDMLRENA ALVVMGNEDA ITAANQERAL FAAITRVL
|
| |