Gene Cphy_0645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0645 
Symbol 
ID5741694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp838382 
End bp841360 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content35% 
IMG OID641291757 
Productpeptidase M16C associated domain-containing protein 
Protein accessionYP_001557771 
Protein GI160878803 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0141412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGAT GCGTTAAACA AAGCAAAGAA CTCCTTCATA TGCCGGCGTA TCAAATAGAA 
TACGAAGAGG AGTTAAATGA TAGCAAGAGT TTAGGACTTG TATTTCGTCA TAAAAAATCG
GGTGCTAGAA TCTGTGTGGT GTCGAATGAA GATGAGAATA AAGTATTTAC AATTGGTTTT
CGAACTCCAC CAAAAAACAG TACGGGTGTA GCTCATATTA TTGAGCATAC AGTATTATGC
GGATCGAAAG AGTTTCCAGC AAAAGACCCA TTTATTGAAC TTGCGAAAGG TTCTTTAAAT
ACCTTTTTAA ATGCAATGAC TTACTCGGAT AAAACAATGT ATCCTGTAGC AAGTTGCAAT
GAGAAAGATT TTCAGAATCT GATCCATGTC TATATGGATG CGGTATTCCA TCCTAACATT
TATTATCGTA GAGAAATTTT TGAACAAGAG GGATGGCATT ACGAACTAGA GGATGTGAAT
TCGGAATTAA AATATAATGG AGTTGTTTAT AATGAGATGA AGGGAGCGTT CTCTTCACCA
GAACAGCAAT TATTCCGTGC AATTCAGGCT AGCTTGTTTC CTGATACTCC ATATGGAGTA
GAGTCAGGTG GTGACCCTGA TTATATACCA GATTTAAGTT ACGAGGAATT TTTAGAGTTT
CATAAAAAGT TTTATCATCC TTCGAACAGC TATATTTACC TGTATGGAGA TATGGATGTA
GAAGAAAAGC TAAACTGGCT TGATGAAGCA TATTTAAGCA CATTTGACAC TCTTAAAGTC
GATTCTGAAA TCCCAATGCA AAAAGCCTTT GATGGTCCAA AGAAGATTAT ATCATTATAT
CCACTAAGCG ATAGCGAAAA CGAAGTAGAT AATACCTACT TAAGCTATAA TGCAGTAATT
GGAACATCCA TTGATACTAC ACAGTGCATG GCATTTCAGA TACTAGAGGC AGCACTACTA
TCTGCCCCTG GAGCACCACT GAAGCAAGCC TTAATTGATG CTGGTATTGG AAAAGACATC
TTAAGTAGCT ATGAAAATGA AATCCTACAA CCAACCTTTA CGATCATTGC AAAAAATGCG
AATGAGGAGC AGTTAGAGAA GTTTCTTTCT GTTATTACCT CAACCCTTGA AAAAATAGTA
AAGGATGGAT TAAATGAAAA ATCACTATTA GCTGCAATCA ATAACTTAGA ATTCCGTTAT
CGTGAAGCTG ACTTTGGACA ATTTCCAAAG GGACTACTTT TTGGTATCCA GATGTTTGGA
AGTTGGCTTT ATGATGATAC GAAAGCATTT GATTATATGC ATGGTAACCG TGTCTTTCAG
TTTTTAAAGA GCAAAATTAA TACAGGATAT TATGAAGATT TGATTAAAAA CTATTTATTA
AATAATACAC ATGCAACCTA CTTGGTATTA AAACCGAAAA AGGGATTGAC TGGGGAAAAA
GAAGCAAAAT TAAAAGAAAA GCTTGAGCAG TATAAAAATT CTTTATCCCT GGAAGAGAAA
GAAGCAATAG TAGCTTCAAC AAATCACTTA AAAGAGTATC AAGAAGCACC ATCAACAAAA
GAGGAGCTTG AAAAAATTCC ACTTCTTACG ATTGATGATA TTAAAAAGGA TGCACAGCCG
CTTCATAATA AGGAATGCTC TCTTGAAAAC TTACCAGTTC TTCACCATGA AGTTTTTACT
AACGGAATTG CATATATCAA ATGTATGTTT GACTTGAGTA AGGTACCAGA AGAACTTGTT
CCTTATCTAA ATTTATTGGC AACTGTTCTT GGTTATATCG ATACAGAAAA CTATAGCTTT
TTAGAGCTGT CGAATGAGAT TAATATCCAT ACAGGTGGTA TAAGTGCGGA ACTGATTACT
TTCAATAAGA AGATGGATCC AGATACGTAT ACTCCAGTAT TTTCGATGAG TGGAAAGGTG
TTATATTCAA AAATCGATAA GTTATTTGAG TTAATTAGGG AAATCCTTCA TCACTCTAAC
TTAGGAGATA CCAAGCGCCT TTTTGAAATT ATTCGTGAGG TTAAGTCAAG AATTCAAATG
AGAATGAATT CTGCAGGTCA TTCTGTTGCG GTTGATCGTG CATTTTCTTA TATTACACAG
AGTGGATATT ATACAGAGGA AACGAAAGGT ATTCGTTATT TTAGATTCCT TGCAACATTG
GAAAAAGAAT TTGAATCTAG AAAAGAGGAA ATTGTATCTT CCTTAAGAAA GTTAAGTGAA
ATCATTTTTA CAAAGGATGG TATGGTGATC AGTATAACAG CTGAGCAAGA TGGTTTTGAA
CAGTTAACAA AGACATTACC TGGATTTACC AACTCCTTAT CTGGTACTCT AGATACTAGT
AATGGTAAAA CTATCAAGGA AACTCTAAAA GCTGCAAACT TTAATTTCCC AGTAGAGAAA
TTAAATGAAG GTTTTATGTA TTCTGGTCAG GTACAATATG TTGCTCGTTG TGCGAATTTT
GTAAATGCAG GGTTTAAAAC GAATGGAGCA CTAAAAGTAC TAAGAACAAT TATGTCATAT
GACTATTTAT GGAATAATGT TCGTGTAAAG GGTGGGGCCT ATGGTTGTAT GTGTCAGTTT
GCTGGACTGG ATGGTTCTGC TTATATGGTA TCTTACCGCG ATCCAAACCT TACTGAGACG
GACGAAACTT ATCGTAAAGC TTATGAGTAT ACTGAAAATT TCACCTCAGA TGAGAGAGAT
ATGACGAAGT ATATCATAGG AACGATGAGT ACTGTAGATA CACCGCTTAC CCCATTGATG
AGAGGAAGCC GTTCTTTAAA TGCCTATATG AGCGGTACTA CAATGGATGA TATCAATCGC
GATCGTAGTG AAATCTTAAG TACAACACAA AATGAAATTC GTATGATGGC ACCGATTGTA
AAAGAAGTAT ACAACGCAGA TAACCTCTGT GTAATCGGTA ATGAAGAAAA AATTAAGAAG
AATGAACAGA TGTTTCAGGA AATTAAAGCA TTATTTTAA
 
Protein sequence
MSRCVKQSKE LLHMPAYQIE YEEELNDSKS LGLVFRHKKS GARICVVSNE DENKVFTIGF 
RTPPKNSTGV AHIIEHTVLC GSKEFPAKDP FIELAKGSLN TFLNAMTYSD KTMYPVASCN
EKDFQNLIHV YMDAVFHPNI YYRREIFEQE GWHYELEDVN SELKYNGVVY NEMKGAFSSP
EQQLFRAIQA SLFPDTPYGV ESGGDPDYIP DLSYEEFLEF HKKFYHPSNS YIYLYGDMDV
EEKLNWLDEA YLSTFDTLKV DSEIPMQKAF DGPKKIISLY PLSDSENEVD NTYLSYNAVI
GTSIDTTQCM AFQILEAALL SAPGAPLKQA LIDAGIGKDI LSSYENEILQ PTFTIIAKNA
NEEQLEKFLS VITSTLEKIV KDGLNEKSLL AAINNLEFRY READFGQFPK GLLFGIQMFG
SWLYDDTKAF DYMHGNRVFQ FLKSKINTGY YEDLIKNYLL NNTHATYLVL KPKKGLTGEK
EAKLKEKLEQ YKNSLSLEEK EAIVASTNHL KEYQEAPSTK EELEKIPLLT IDDIKKDAQP
LHNKECSLEN LPVLHHEVFT NGIAYIKCMF DLSKVPEELV PYLNLLATVL GYIDTENYSF
LELSNEINIH TGGISAELIT FNKKMDPDTY TPVFSMSGKV LYSKIDKLFE LIREILHHSN
LGDTKRLFEI IREVKSRIQM RMNSAGHSVA VDRAFSYITQ SGYYTEETKG IRYFRFLATL
EKEFESRKEE IVSSLRKLSE IIFTKDGMVI SITAEQDGFE QLTKTLPGFT NSLSGTLDTS
NGKTIKETLK AANFNFPVEK LNEGFMYSGQ VQYVARCANF VNAGFKTNGA LKVLRTIMSY
DYLWNNVRVK GGAYGCMCQF AGLDGSAYMV SYRDPNLTET DETYRKAYEY TENFTSDERD
MTKYIIGTMS TVDTPLTPLM RGSRSLNAYM SGTTMDDINR DRSEILSTTQ NEIRMMAPIV
KEVYNADNLC VIGNEEKIKK NEQMFQEIKA LF