Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0695 |
Symbol | |
ID | 5538160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 911671 |
End bp | 914577 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640892851 |
Product | peptidase M16C associated domain-containing protein |
Protein accession | YP_001430835 |
Protein GI | 156740706 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00570537 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000127809 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAATGTGA TTCATGGATT TGAACTGCTC CGCGAGCAGC AGATCGCCGA ACTGAACTCA TTGGCGCGCT GGTATCGCCA TGTCGCCACC GGCGCTGAAC TTCTTTCGTT GATCAATGAC GATGAAAATA AGGTCTTTGG CATTACTTTC CGCACCCCTC CACCCGACTC AACCGGCGTG GCGCACATTC TCGAACACAG CGTCCTGTGC GGCTCTGAAA AGTATCCCCT GAAAAAGCCG TTTGTCGAAT TACTTAAAGG ATCGCTCAAA ACTTTCCTCA ACGCGATGAC CTATTCGGAT AAAACGGTCT ATCCGGTTGC GTCCACCAAT ACGAAGGATT TCTACAATCT CGTCGATGTG TACCTCGATG CCGTCTTCCA CCCGCGCATT TCGCCGGAGG TGTTGCAACA GGAGGGTTGG CGCTATGAGG TGAACGAGGA TGGCTCGCTC GGCTACCGTG GGGTGGTCTT CAACGAGATG AAGGGCGCCA ACGTATCGCC CGACCGCGTG CTCTACCTGG CAGTGCAGCG GTCGCTCTTC CCCGGTCATG TGTACAGCGT CGATTCGGGC GGCGATCCGG CTGAGATTCC CAATCTGACC TATGAGCAGT TCAAAGCGTT CCACGAGCGG TATTACCATC CCTCAAATGC GCTGATCTTT TTCTATGGCG ACGATGATCC GGAAGAGCGT CTGCGCCTGC TCGATCGCGT CCTGGCGCCA TTCGAGCGTA TCCCGGTCGA CTCGATGATC CCGCTGCAAC CGCCATTGAG TGAACCGCAA CACGTCGAAG CGCCCTATCC GGCTGGTCCA AACAGCATCG ACAAGCATAT GGTGGCGGTC AACTGGCTGC TTCCCAACCC GCCGGATATC GAAGAGGCGC TCGCGCTCGA CATCCTGGAG CACGCGCTGG TCGGGACACC AGCCGCGCCG CTGCGCAAGG CGCTCATCGA CTCTGGTCTG GGAGAAAATC TGACCGGATC GGGATTCGCC CGTCTGCGTC AGACCTACTT TACCGTCGGG TTGAAAGGGG TGAAGGGCGA GAATATCGGC GCGACCGAGG ATGTGATCAT TGGAACACTG GAGCGCCTGG CGCGTGACGG GATCGATTCG CAAACTATCG AAGCAGCGGT CAACACGGTC GAGTTTCAGT TGCGCGAGAA TAATACCGGT TCCTACCCGC GTGGTTTAGC AGTCCTCATC CGCGCACTCG ACACCTGGCT CTATGGCGAT GATCCGCTGG CGCCGCTCAT GTTCGAAGCG CCGCTCCGCG CCATCAAGCA GCGGTTGAGC GCAGGGGAGC GCGTCTTCGA GCATATGATC GAGGAGAAGT TGCTGCGCAA CCCTCACCGC ACAACGGTCG TGCTCGTTCC CGATCTGGAA TTGACCAACC GCCAGAACGC TGCTGAACGC GAACGCCTGG TGGCGATTCG CGCAACGCTC GATGAGGCGC AGATCGCGGC GATCAACGCG ACTGCCGCGC GCCTCAAACA GATCCAGGAA ACCCCCGATC CGCCAGAGGC GCTTGCGTCG CTGCCGAGCC TGACGATTGC CGATCTCGAC CGGACGATCA AGACCATCCC CACCGAAGAA CTGGCGATTG GCGCAACGCG CGTGTTGCTC CACAATCTGT TCACCAACGG CATCGTGTAT GTGGACATTG GCATGAACCT GCGCGTGTTG CCCCAGGAGT TTCTGCCGTA TGTGACGATC TTTGGACGTG CCCTTCTCGA AACCGGTACG CAACACGAAG ATGTCGTTCA ATTGATCCAA CGGATCGGGC GCGATACCGG CGGCATCTTC CCGCAATCGT TCACCTCGGC GATGCGCGGA CGCAGCATCG GCGCCGCCTG GTTGTTCCTG CGCGGGAAAG CCATCGTCGA AAAGAGCGAT GCGCTGCTCG ACATTCTGCA CGACGTTGTG TTGTCGGCGC GCCTTGACAA CCGTGAGCGC ATCCGGCAGA TCGTGCGCGA GGAGCGCGCC TCGCGCGAAG CCAGCCTGAT CCCTGCCGGT CACACGGTGG TCAGCACGCG CCTGCGCGCG CGCTTCAGCG AAGCCGATTG GGTCGCCGAG CAGATCGGCG GCGTCAGTTA CCTGATGTTC CTGCGCCGGA TCGAGCGAAC CATCGATGAG GAGTGGGAAA CGGTGCGCGC CGTGCTTGAG CACATGCGCG CGCGGCTGAT CGATCGCAGC GCACTGCTGG TGAATGTGAC GGTGGACGCT GCCGGATGGG AGCGCTTCCG TCCGCACCTG GAAGCGTTCC TCGACCGTCT GCCGGTCGGA ACGACGATAC CGGCGGCATG GAATCCGCAC AAAGGTGCGC CATCGGAAGG GTTGATCATT CCCGCACATG TCAACTACGT CGCCAAAGGC GCCGACCTGT ATCGCCTGGG GTATCGGCTG CACGGCTCGG CGCTCGTGGT GACGCGCTAC CTGATGACGA CCTGGCTGTG GGAGCAGATT CGTGAGCAAG GAGGAGCATA CGGCGGCTTC TGCTCGTTCG ACCCGCGATC GGGCGTGTTC AGTTACACAT CATACCGCGA TCCCAATCTG CTGCGCACGA TCGATGTGTA TGACCGTTCT GCCGCCTTTT TGCGCCAACT CGACCTGAGC GAGAAAGAGT TGACCCGTGC CATCATCGGC GTTATTGCCG ACCTCGACGC CTACCAACTG CCGGACGCGC GCGGCTTCAC GGCAATGGCG CGATTTCTGG TCGGCGACGA CGATGCCTAC CGCCAGCAGG TGCGTGAGGA AGTGCTCGGC ACAACGCCGG CCGACTTTCG TGCCTTTGCC GATGTGCTCG ACATCGTGCG CGACAACGCT GCGCTCGTTG TAATGGGCGG CGAAGATGCC ATTACTGCCG CCAACCAGGA ACGCTCCCTG TTCGCCGAGA TCACGCGCGT GCTGTAA
|
Protein sequence | MNVIHGFELL REQQIAELNS LARWYRHVAT GAELLSLIND DENKVFGITF RTPPPDSTGV AHILEHSVLC GSEKYPLKKP FVELLKGSLK TFLNAMTYSD KTVYPVASTN TKDFYNLVDV YLDAVFHPRI SPEVLQQEGW RYEVNEDGSL GYRGVVFNEM KGANVSPDRV LYLAVQRSLF PGHVYSVDSG GDPAEIPNLT YEQFKAFHER YYHPSNALIF FYGDDDPEER LRLLDRVLAP FERIPVDSMI PLQPPLSEPQ HVEAPYPAGP NSIDKHMVAV NWLLPNPPDI EEALALDILE HALVGTPAAP LRKALIDSGL GENLTGSGFA RLRQTYFTVG LKGVKGENIG ATEDVIIGTL ERLARDGIDS QTIEAAVNTV EFQLRENNTG SYPRGLAVLI RALDTWLYGD DPLAPLMFEA PLRAIKQRLS AGERVFEHMI EEKLLRNPHR TTVVLVPDLE LTNRQNAAER ERLVAIRATL DEAQIAAINA TAARLKQIQE TPDPPEALAS LPSLTIADLD RTIKTIPTEE LAIGATRVLL HNLFTNGIVY VDIGMNLRVL PQEFLPYVTI FGRALLETGT QHEDVVQLIQ RIGRDTGGIF PQSFTSAMRG RSIGAAWLFL RGKAIVEKSD ALLDILHDVV LSARLDNRER IRQIVREERA SREASLIPAG HTVVSTRLRA RFSEADWVAE QIGGVSYLMF LRRIERTIDE EWETVRAVLE HMRARLIDRS ALLVNVTVDA AGWERFRPHL EAFLDRLPVG TTIPAAWNPH KGAPSEGLII PAHVNYVAKG ADLYRLGYRL HGSALVVTRY LMTTWLWEQI REQGGAYGGF CSFDPRSGVF SYTSYRDPNL LRTIDVYDRS AAFLRQLDLS EKELTRAIIG VIADLDAYQL PDARGFTAMA RFLVGDDDAY RQQVREEVLG TTPADFRAFA DVLDIVRDNA ALVVMGGEDA ITAANQERSL FAEITRVL
|
| |