Gene Rcas_0695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0695 
Symbol 
ID5538160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp911671 
End bp914577 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content60% 
IMG OID640892851 
Productpeptidase M16C associated domain-containing protein 
Protein accessionYP_001430835 
Protein GI156740706 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00570537 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000127809 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATGTGA TTCATGGATT TGAACTGCTC CGCGAGCAGC AGATCGCCGA ACTGAACTCA 
TTGGCGCGCT GGTATCGCCA TGTCGCCACC GGCGCTGAAC TTCTTTCGTT GATCAATGAC
GATGAAAATA AGGTCTTTGG CATTACTTTC CGCACCCCTC CACCCGACTC AACCGGCGTG
GCGCACATTC TCGAACACAG CGTCCTGTGC GGCTCTGAAA AGTATCCCCT GAAAAAGCCG
TTTGTCGAAT TACTTAAAGG ATCGCTCAAA ACTTTCCTCA ACGCGATGAC CTATTCGGAT
AAAACGGTCT ATCCGGTTGC GTCCACCAAT ACGAAGGATT TCTACAATCT CGTCGATGTG
TACCTCGATG CCGTCTTCCA CCCGCGCATT TCGCCGGAGG TGTTGCAACA GGAGGGTTGG
CGCTATGAGG TGAACGAGGA TGGCTCGCTC GGCTACCGTG GGGTGGTCTT CAACGAGATG
AAGGGCGCCA ACGTATCGCC CGACCGCGTG CTCTACCTGG CAGTGCAGCG GTCGCTCTTC
CCCGGTCATG TGTACAGCGT CGATTCGGGC GGCGATCCGG CTGAGATTCC CAATCTGACC
TATGAGCAGT TCAAAGCGTT CCACGAGCGG TATTACCATC CCTCAAATGC GCTGATCTTT
TTCTATGGCG ACGATGATCC GGAAGAGCGT CTGCGCCTGC TCGATCGCGT CCTGGCGCCA
TTCGAGCGTA TCCCGGTCGA CTCGATGATC CCGCTGCAAC CGCCATTGAG TGAACCGCAA
CACGTCGAAG CGCCCTATCC GGCTGGTCCA AACAGCATCG ACAAGCATAT GGTGGCGGTC
AACTGGCTGC TTCCCAACCC GCCGGATATC GAAGAGGCGC TCGCGCTCGA CATCCTGGAG
CACGCGCTGG TCGGGACACC AGCCGCGCCG CTGCGCAAGG CGCTCATCGA CTCTGGTCTG
GGAGAAAATC TGACCGGATC GGGATTCGCC CGTCTGCGTC AGACCTACTT TACCGTCGGG
TTGAAAGGGG TGAAGGGCGA GAATATCGGC GCGACCGAGG ATGTGATCAT TGGAACACTG
GAGCGCCTGG CGCGTGACGG GATCGATTCG CAAACTATCG AAGCAGCGGT CAACACGGTC
GAGTTTCAGT TGCGCGAGAA TAATACCGGT TCCTACCCGC GTGGTTTAGC AGTCCTCATC
CGCGCACTCG ACACCTGGCT CTATGGCGAT GATCCGCTGG CGCCGCTCAT GTTCGAAGCG
CCGCTCCGCG CCATCAAGCA GCGGTTGAGC GCAGGGGAGC GCGTCTTCGA GCATATGATC
GAGGAGAAGT TGCTGCGCAA CCCTCACCGC ACAACGGTCG TGCTCGTTCC CGATCTGGAA
TTGACCAACC GCCAGAACGC TGCTGAACGC GAACGCCTGG TGGCGATTCG CGCAACGCTC
GATGAGGCGC AGATCGCGGC GATCAACGCG ACTGCCGCGC GCCTCAAACA GATCCAGGAA
ACCCCCGATC CGCCAGAGGC GCTTGCGTCG CTGCCGAGCC TGACGATTGC CGATCTCGAC
CGGACGATCA AGACCATCCC CACCGAAGAA CTGGCGATTG GCGCAACGCG CGTGTTGCTC
CACAATCTGT TCACCAACGG CATCGTGTAT GTGGACATTG GCATGAACCT GCGCGTGTTG
CCCCAGGAGT TTCTGCCGTA TGTGACGATC TTTGGACGTG CCCTTCTCGA AACCGGTACG
CAACACGAAG ATGTCGTTCA ATTGATCCAA CGGATCGGGC GCGATACCGG CGGCATCTTC
CCGCAATCGT TCACCTCGGC GATGCGCGGA CGCAGCATCG GCGCCGCCTG GTTGTTCCTG
CGCGGGAAAG CCATCGTCGA AAAGAGCGAT GCGCTGCTCG ACATTCTGCA CGACGTTGTG
TTGTCGGCGC GCCTTGACAA CCGTGAGCGC ATCCGGCAGA TCGTGCGCGA GGAGCGCGCC
TCGCGCGAAG CCAGCCTGAT CCCTGCCGGT CACACGGTGG TCAGCACGCG CCTGCGCGCG
CGCTTCAGCG AAGCCGATTG GGTCGCCGAG CAGATCGGCG GCGTCAGTTA CCTGATGTTC
CTGCGCCGGA TCGAGCGAAC CATCGATGAG GAGTGGGAAA CGGTGCGCGC CGTGCTTGAG
CACATGCGCG CGCGGCTGAT CGATCGCAGC GCACTGCTGG TGAATGTGAC GGTGGACGCT
GCCGGATGGG AGCGCTTCCG TCCGCACCTG GAAGCGTTCC TCGACCGTCT GCCGGTCGGA
ACGACGATAC CGGCGGCATG GAATCCGCAC AAAGGTGCGC CATCGGAAGG GTTGATCATT
CCCGCACATG TCAACTACGT CGCCAAAGGC GCCGACCTGT ATCGCCTGGG GTATCGGCTG
CACGGCTCGG CGCTCGTGGT GACGCGCTAC CTGATGACGA CCTGGCTGTG GGAGCAGATT
CGTGAGCAAG GAGGAGCATA CGGCGGCTTC TGCTCGTTCG ACCCGCGATC GGGCGTGTTC
AGTTACACAT CATACCGCGA TCCCAATCTG CTGCGCACGA TCGATGTGTA TGACCGTTCT
GCCGCCTTTT TGCGCCAACT CGACCTGAGC GAGAAAGAGT TGACCCGTGC CATCATCGGC
GTTATTGCCG ACCTCGACGC CTACCAACTG CCGGACGCGC GCGGCTTCAC GGCAATGGCG
CGATTTCTGG TCGGCGACGA CGATGCCTAC CGCCAGCAGG TGCGTGAGGA AGTGCTCGGC
ACAACGCCGG CCGACTTTCG TGCCTTTGCC GATGTGCTCG ACATCGTGCG CGACAACGCT
GCGCTCGTTG TAATGGGCGG CGAAGATGCC ATTACTGCCG CCAACCAGGA ACGCTCCCTG
TTCGCCGAGA TCACGCGCGT GCTGTAA
 
Protein sequence
MNVIHGFELL REQQIAELNS LARWYRHVAT GAELLSLIND DENKVFGITF RTPPPDSTGV 
AHILEHSVLC GSEKYPLKKP FVELLKGSLK TFLNAMTYSD KTVYPVASTN TKDFYNLVDV
YLDAVFHPRI SPEVLQQEGW RYEVNEDGSL GYRGVVFNEM KGANVSPDRV LYLAVQRSLF
PGHVYSVDSG GDPAEIPNLT YEQFKAFHER YYHPSNALIF FYGDDDPEER LRLLDRVLAP
FERIPVDSMI PLQPPLSEPQ HVEAPYPAGP NSIDKHMVAV NWLLPNPPDI EEALALDILE
HALVGTPAAP LRKALIDSGL GENLTGSGFA RLRQTYFTVG LKGVKGENIG ATEDVIIGTL
ERLARDGIDS QTIEAAVNTV EFQLRENNTG SYPRGLAVLI RALDTWLYGD DPLAPLMFEA
PLRAIKQRLS AGERVFEHMI EEKLLRNPHR TTVVLVPDLE LTNRQNAAER ERLVAIRATL
DEAQIAAINA TAARLKQIQE TPDPPEALAS LPSLTIADLD RTIKTIPTEE LAIGATRVLL
HNLFTNGIVY VDIGMNLRVL PQEFLPYVTI FGRALLETGT QHEDVVQLIQ RIGRDTGGIF
PQSFTSAMRG RSIGAAWLFL RGKAIVEKSD ALLDILHDVV LSARLDNRER IRQIVREERA
SREASLIPAG HTVVSTRLRA RFSEADWVAE QIGGVSYLMF LRRIERTIDE EWETVRAVLE
HMRARLIDRS ALLVNVTVDA AGWERFRPHL EAFLDRLPVG TTIPAAWNPH KGAPSEGLII
PAHVNYVAKG ADLYRLGYRL HGSALVVTRY LMTTWLWEQI REQGGAYGGF CSFDPRSGVF
SYTSYRDPNL LRTIDVYDRS AAFLRQLDLS EKELTRAIIG VIADLDAYQL PDARGFTAMA
RFLVGDDDAY RQQVREEVLG TTPADFRAFA DVLDIVRDNA ALVVMGGEDA ITAANQERSL
FAEITRVL