Gene RoseRS_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0158 
Symbol 
ID5207091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp201996 
End bp204902 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content60% 
IMG OID640593786 
Productpeptidase M16C associated domain-containing protein 
Protein accessionYP_001274544 
Protein GI148654339 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.471453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA CACACGGCTT TGAACTGCTC CGTGAGCAGC AGATATCCGA GTTGAACACC 
CTGGCGCGTC TGTATCGCCA TGTCGCCACC GGCGCCGAAC TTCTTTCGCT GATCAACGAT
GACGAAAATA AGGTCTTCGG GATTACCTTC CGTACCCCGC CACCCGACTC GACCGGCGTG
GCGCATATTC TCGAACACAG CGTCCTGTGC GGCTCCGAGA AATACCCATT GAAGAAGCCG
TTCGTCGAGT TGCTCAAAGG ATCGCTCAAA ACATTCCTCA ATGCCATCAC CTTTTCGGAT
AAAACTGTCT ATCCGGTTGC GTCCACAAAT ACAAAGGATT TCTACAATCT GATCGATGTC
TACCTCGATG CCGTCTTTCA TCCGCGCATC ACGCCAGAGG TGTTGCAGCA GGAAGGGTGG
CGCTATGAAC TGAATGAGGA CGGGTCGTTG GGATACCGCG GCGTGGTCTT CAACGAGATG
AAGGGCGCCA ATGCTTCACC CGACCGGGTG CTCTATGTTG CAGTGCAACG GTCGCTGTTC
CCCGGTCATA TCTACAGCGT CGACTCTGGC GGCGATCCGG CGGTTATTCC AAACCTGACC
TACGAACAGT TCAGGGCGTT TCATGAGCGC TACTACCATC CTTCCAACGC CCTGATCTTC
TTCTACGGCG ATGATGACCC GGAAGAACGC CTGCGCCTGC TGGAGCGCGT GCTGGCGCCT
TTCGAGCGCA TTTCTGTCGA TGCGACAATC CCGTTGCAAC CGCCATTCCG CGAACCGCAA
CGCCTTGAGG TTCCATATCC CGCCGGTCCG AACAGCGCCG ACAAACATAT GGTGACGGTC
AACTGGCTCC TGCCCGATCC ACCCGATGTT GAAGAAGCGC TGGCGCTCGA CATCCTGGAG
CATGCGCTGG TCGGCACGCC GGCTTCCCCG CTGCGCAAAG CCCTGATCGA TTCCGGGCTG
GGAGAGAACC TCACCGGTTC GGGGTTTGCC CGCCTGCGTC AGACGTTCTT TACCGTGGGT
CTGAAAGGGG TCAAAGGCGA ACACGTGCAC GCTGTCGAGA ACATGATTAT CGATACGCTT
GGACGCCTGG TGCACGACGG GATCGATCCG CAAACGATCG AAGCGGCAGT CAACACGGTC
GAGTTTCAGT TGCGCGAGAA CAACACCGGT TCGTATCCAC GCGGTCTGGT CGTGCTGTTC
CGTGCGCTCG ACACCTGGCT CTACGGCGAG GATCCACTGG CGCCGTTGAT GTTCGAGGCG
CCGCTACGCG CAGTCAAGCA GCGTCTGCAC AACGGAGGGC GCTTCTTCGA GCGCCTGATC
GAAGAGCGGC TCCTGCGCAA TCCGCACCGC ACAACAGTCG TGCTCGTGCC TGATCTTGAA
TTGACCAATC GCCAGAACGC TGCCGAGCGT GAGCGCCTCG CGGCGATCCG CGCCACCCTC
GATGATGCAC AGATCGAACA GATCGCCACA ACCGCTGCGC GTCTCAAGCA GATCCAGGAG
ACGCCCGATC CGCCGGAGGC GCTTGCGTTG CTCCCCAGTC TGACGATTGC CGATCTCGAC
CGGAAGATCA AAACAACGCC TACCGAAGAG ATGCACATCG GTGCAACACG TGTGCTGCTG
CACGACCTTT TTACCAACGG GATCGTGTAT ATCGACGTTG GCATGAACCT GCACACGCTG
CCGCAGGAGT TGCTCCCATA TGTCACTATT TTCGGGCGTG CGCTCCTCGA AACCGGCACG
CAGCACGACG ACATCATCCA GTTGACGCAG CGGATCGGGC GCGATACCGG CGGCATCTTT
CCCCAAACGT TCACGTCCGC GATGCGTGGG CAGAGTGATG GCGCCGCCTG GCTGTTCCTG
CGCGGGAAGG CAATTCTGGA GAAAAGCGAT GCGCTGCTCG ACATCCTGCA CGACGTTGTG
CACTCCGCCC GTCTTGACAA CCGCGACCGC ATTCGCCAGA TTGTGCGCGA AGAACGTGCG
TCGCGTGAAG CCAGCCTGAT CCCGGCTGGT CACACGGTCG TCAACACACG CCTGCGCGCA
CGGTTCAACG AAGCCGACTG GGCAGCGGAA CAGATCGGCG GGGTCAGCTA CCTCCTCTTC
CTGCGGCGTG TCGAGCGGGC TATCGATGAG GAATGGGATA CAGTATACAC TGTACTGGAG
CGGATGCGCA CCCTGCTGGT CAATCGGAGC GCCCTGCTGG TTAACGTGAC TGTGGACGCT
GCCGGTTGGG ATCGGTTCCG CCCCCGTCTC GAAGCATTTC TTGACCGGCT GCCCGCTGGC
GAATCTGTGC TGGCGGCGTG GAACCCGCAG CCCGGCGCAC CATCAGAAGG GTTGCTCATT
CCCGCAAACG TGAACTACGT TGCCAAAGGC GCCAGCCTGT ATCGCCTGGG GTACCGGCTG
CACGGCTCGG CGCTGGTGGT GACGCGCTAC CTGATGACCA CCTGGCTATG GGAACAGATC
CGCGAGCAGG GTGGCGCTTA CGGCGGCTTC TGCTCGTTCG ACCCGCGATC CGGCATGTTC
AGTTACACGT CGTACCGCGA CCCCAACCTG CTGCGCACCA TCGAGGTCTA CGACCGTTCC
GCCGAATTTT TGCGCCAGCT CGAATTGAGC GAGAAGGAGT TGACCCGCGC CATCATCGGC
GTCATCGCCG AACTCGACGC ATACCAGCTC CCCGACGCAC GCGGTTTTAC CGCAATGGCG
CGCCATATCG TCGGTGATGA TGACGCCTAT CGCCAGCAGG TGCGCGACGA GGTGCTGGGC
ACGACGCCCG CCGACTTCCG TGCGTTTGCC GATGTGCTCG ACATGCTGCG CGAAAACGCT
GCGCTCGTTG TGATGGGAAA TGAAGACGCC ATAACCGCCG CCAATCAGGA ACGTGCGTTG
TTTGCCGCCA TCACACGCGT GCTGTAA
 
Protein sequence
MNITHGFELL REQQISELNT LARLYRHVAT GAELLSLIND DENKVFGITF RTPPPDSTGV 
AHILEHSVLC GSEKYPLKKP FVELLKGSLK TFLNAITFSD KTVYPVASTN TKDFYNLIDV
YLDAVFHPRI TPEVLQQEGW RYELNEDGSL GYRGVVFNEM KGANASPDRV LYVAVQRSLF
PGHIYSVDSG GDPAVIPNLT YEQFRAFHER YYHPSNALIF FYGDDDPEER LRLLERVLAP
FERISVDATI PLQPPFREPQ RLEVPYPAGP NSADKHMVTV NWLLPDPPDV EEALALDILE
HALVGTPASP LRKALIDSGL GENLTGSGFA RLRQTFFTVG LKGVKGEHVH AVENMIIDTL
GRLVHDGIDP QTIEAAVNTV EFQLRENNTG SYPRGLVVLF RALDTWLYGE DPLAPLMFEA
PLRAVKQRLH NGGRFFERLI EERLLRNPHR TTVVLVPDLE LTNRQNAAER ERLAAIRATL
DDAQIEQIAT TAARLKQIQE TPDPPEALAL LPSLTIADLD RKIKTTPTEE MHIGATRVLL
HDLFTNGIVY IDVGMNLHTL PQELLPYVTI FGRALLETGT QHDDIIQLTQ RIGRDTGGIF
PQTFTSAMRG QSDGAAWLFL RGKAILEKSD ALLDILHDVV HSARLDNRDR IRQIVREERA
SREASLIPAG HTVVNTRLRA RFNEADWAAE QIGGVSYLLF LRRVERAIDE EWDTVYTVLE
RMRTLLVNRS ALLVNVTVDA AGWDRFRPRL EAFLDRLPAG ESVLAAWNPQ PGAPSEGLLI
PANVNYVAKG ASLYRLGYRL HGSALVVTRY LMTTWLWEQI REQGGAYGGF CSFDPRSGMF
SYTSYRDPNL LRTIEVYDRS AEFLRQLELS EKELTRAIIG VIAELDAYQL PDARGFTAMA
RHIVGDDDAY RQQVRDEVLG TTPADFRAFA DVLDMLRENA ALVVMGNEDA ITAANQERAL
FAAITRVL