Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0645 |
Symbol | |
ID | 5741694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 838382 |
End bp | 841360 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641291757 |
Product | peptidase M16C associated domain-containing protein |
Protein accession | YP_001557771 |
Protein GI | 160878803 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0141412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGAT GCGTTAAACA AAGCAAAGAA CTCCTTCATA TGCCGGCGTA TCAAATAGAA TACGAAGAGG AGTTAAATGA TAGCAAGAGT TTAGGACTTG TATTTCGTCA TAAAAAATCG GGTGCTAGAA TCTGTGTGGT GTCGAATGAA GATGAGAATA AAGTATTTAC AATTGGTTTT CGAACTCCAC CAAAAAACAG TACGGGTGTA GCTCATATTA TTGAGCATAC AGTATTATGC GGATCGAAAG AGTTTCCAGC AAAAGACCCA TTTATTGAAC TTGCGAAAGG TTCTTTAAAT ACCTTTTTAA ATGCAATGAC TTACTCGGAT AAAACAATGT ATCCTGTAGC AAGTTGCAAT GAGAAAGATT TTCAGAATCT GATCCATGTC TATATGGATG CGGTATTCCA TCCTAACATT TATTATCGTA GAGAAATTTT TGAACAAGAG GGATGGCATT ACGAACTAGA GGATGTGAAT TCGGAATTAA AATATAATGG AGTTGTTTAT AATGAGATGA AGGGAGCGTT CTCTTCACCA GAACAGCAAT TATTCCGTGC AATTCAGGCT AGCTTGTTTC CTGATACTCC ATATGGAGTA GAGTCAGGTG GTGACCCTGA TTATATACCA GATTTAAGTT ACGAGGAATT TTTAGAGTTT CATAAAAAGT TTTATCATCC TTCGAACAGC TATATTTACC TGTATGGAGA TATGGATGTA GAAGAAAAGC TAAACTGGCT TGATGAAGCA TATTTAAGCA CATTTGACAC TCTTAAAGTC GATTCTGAAA TCCCAATGCA AAAAGCCTTT GATGGTCCAA AGAAGATTAT ATCATTATAT CCACTAAGCG ATAGCGAAAA CGAAGTAGAT AATACCTACT TAAGCTATAA TGCAGTAATT GGAACATCCA TTGATACTAC ACAGTGCATG GCATTTCAGA TACTAGAGGC AGCACTACTA TCTGCCCCTG GAGCACCACT GAAGCAAGCC TTAATTGATG CTGGTATTGG AAAAGACATC TTAAGTAGCT ATGAAAATGA AATCCTACAA CCAACCTTTA CGATCATTGC AAAAAATGCG AATGAGGAGC AGTTAGAGAA GTTTCTTTCT GTTATTACCT CAACCCTTGA AAAAATAGTA AAGGATGGAT TAAATGAAAA ATCACTATTA GCTGCAATCA ATAACTTAGA ATTCCGTTAT CGTGAAGCTG ACTTTGGACA ATTTCCAAAG GGACTACTTT TTGGTATCCA GATGTTTGGA AGTTGGCTTT ATGATGATAC GAAAGCATTT GATTATATGC ATGGTAACCG TGTCTTTCAG TTTTTAAAGA GCAAAATTAA TACAGGATAT TATGAAGATT TGATTAAAAA CTATTTATTA AATAATACAC ATGCAACCTA CTTGGTATTA AAACCGAAAA AGGGATTGAC TGGGGAAAAA GAAGCAAAAT TAAAAGAAAA GCTTGAGCAG TATAAAAATT CTTTATCCCT GGAAGAGAAA GAAGCAATAG TAGCTTCAAC AAATCACTTA AAAGAGTATC AAGAAGCACC ATCAACAAAA GAGGAGCTTG AAAAAATTCC ACTTCTTACG ATTGATGATA TTAAAAAGGA TGCACAGCCG CTTCATAATA AGGAATGCTC TCTTGAAAAC TTACCAGTTC TTCACCATGA AGTTTTTACT AACGGAATTG CATATATCAA ATGTATGTTT GACTTGAGTA AGGTACCAGA AGAACTTGTT CCTTATCTAA ATTTATTGGC AACTGTTCTT GGTTATATCG ATACAGAAAA CTATAGCTTT TTAGAGCTGT CGAATGAGAT TAATATCCAT ACAGGTGGTA TAAGTGCGGA ACTGATTACT TTCAATAAGA AGATGGATCC AGATACGTAT ACTCCAGTAT TTTCGATGAG TGGAAAGGTG TTATATTCAA AAATCGATAA GTTATTTGAG TTAATTAGGG AAATCCTTCA TCACTCTAAC TTAGGAGATA CCAAGCGCCT TTTTGAAATT ATTCGTGAGG TTAAGTCAAG AATTCAAATG AGAATGAATT CTGCAGGTCA TTCTGTTGCG GTTGATCGTG CATTTTCTTA TATTACACAG AGTGGATATT ATACAGAGGA AACGAAAGGT ATTCGTTATT TTAGATTCCT TGCAACATTG GAAAAAGAAT TTGAATCTAG AAAAGAGGAA ATTGTATCTT CCTTAAGAAA GTTAAGTGAA ATCATTTTTA CAAAGGATGG TATGGTGATC AGTATAACAG CTGAGCAAGA TGGTTTTGAA CAGTTAACAA AGACATTACC TGGATTTACC AACTCCTTAT CTGGTACTCT AGATACTAGT AATGGTAAAA CTATCAAGGA AACTCTAAAA GCTGCAAACT TTAATTTCCC AGTAGAGAAA TTAAATGAAG GTTTTATGTA TTCTGGTCAG GTACAATATG TTGCTCGTTG TGCGAATTTT GTAAATGCAG GGTTTAAAAC GAATGGAGCA CTAAAAGTAC TAAGAACAAT TATGTCATAT GACTATTTAT GGAATAATGT TCGTGTAAAG GGTGGGGCCT ATGGTTGTAT GTGTCAGTTT GCTGGACTGG ATGGTTCTGC TTATATGGTA TCTTACCGCG ATCCAAACCT TACTGAGACG GACGAAACTT ATCGTAAAGC TTATGAGTAT ACTGAAAATT TCACCTCAGA TGAGAGAGAT ATGACGAAGT ATATCATAGG AACGATGAGT ACTGTAGATA CACCGCTTAC CCCATTGATG AGAGGAAGCC GTTCTTTAAA TGCCTATATG AGCGGTACTA CAATGGATGA TATCAATCGC GATCGTAGTG AAATCTTAAG TACAACACAA AATGAAATTC GTATGATGGC ACCGATTGTA AAAGAAGTAT ACAACGCAGA TAACCTCTGT GTAATCGGTA ATGAAGAAAA AATTAAGAAG AATGAACAGA TGTTTCAGGA AATTAAAGCA TTATTTTAA
|
Protein sequence | MSRCVKQSKE LLHMPAYQIE YEEELNDSKS LGLVFRHKKS GARICVVSNE DENKVFTIGF RTPPKNSTGV AHIIEHTVLC GSKEFPAKDP FIELAKGSLN TFLNAMTYSD KTMYPVASCN EKDFQNLIHV YMDAVFHPNI YYRREIFEQE GWHYELEDVN SELKYNGVVY NEMKGAFSSP EQQLFRAIQA SLFPDTPYGV ESGGDPDYIP DLSYEEFLEF HKKFYHPSNS YIYLYGDMDV EEKLNWLDEA YLSTFDTLKV DSEIPMQKAF DGPKKIISLY PLSDSENEVD NTYLSYNAVI GTSIDTTQCM AFQILEAALL SAPGAPLKQA LIDAGIGKDI LSSYENEILQ PTFTIIAKNA NEEQLEKFLS VITSTLEKIV KDGLNEKSLL AAINNLEFRY READFGQFPK GLLFGIQMFG SWLYDDTKAF DYMHGNRVFQ FLKSKINTGY YEDLIKNYLL NNTHATYLVL KPKKGLTGEK EAKLKEKLEQ YKNSLSLEEK EAIVASTNHL KEYQEAPSTK EELEKIPLLT IDDIKKDAQP LHNKECSLEN LPVLHHEVFT NGIAYIKCMF DLSKVPEELV PYLNLLATVL GYIDTENYSF LELSNEINIH TGGISAELIT FNKKMDPDTY TPVFSMSGKV LYSKIDKLFE LIREILHHSN LGDTKRLFEI IREVKSRIQM RMNSAGHSVA VDRAFSYITQ SGYYTEETKG IRYFRFLATL EKEFESRKEE IVSSLRKLSE IIFTKDGMVI SITAEQDGFE QLTKTLPGFT NSLSGTLDTS NGKTIKETLK AANFNFPVEK LNEGFMYSGQ VQYVARCANF VNAGFKTNGA LKVLRTIMSY DYLWNNVRVK GGAYGCMCQF AGLDGSAYMV SYRDPNLTET DETYRKAYEY TENFTSDERD MTKYIIGTMS TVDTPLTPLM RGSRSLNAYM SGTTMDDINR DRSEILSTTQ NEIRMMAPIV KEVYNADNLC VIGNEEKIKK NEQMFQEIKA LF
|
| |