Gene Cphy_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1336 
Symbol 
ID5743723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1667018 
End bp1669471 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content38% 
IMG OID641292441 
Productpeptidase U32 
Protein accessionYP_001558452 
Protein GI160879484 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0126233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TAGAAATATT GGCACCTGCG GGTTCCATAG AAAGCATGAG GGCAGCAATC 
AATGCTGGCT GTGATGCTGT ATACATTGGC GGTATGAAAT TTGGTGCAAG AGCTTATGCA
AATAATCTTA CGGAAGATAT GCTCTTAGAT GCAATCGACT ATGCGCATGT GAGAGGTAAG
CAACTTTATC TAACGGTGAA TACGTTACTA AAAGAGGAAG AATTAGAGGA TTCCTTATAC
GAATATATTC GAAAGTATTA TGAACATGGG TTAGATGCAG TAATTGTACA GGACGTTGGA
GTTATGAGGT TTATTCATCG TAATTTTCCA GACCTTCCAA TCCATGCTAG TACCCAGATG
ACATTAACCA TGGCACAGGG GGCTGATGTA TTAAAGGAGT TTGGTGTAAC CCGTATGGTT
ATTTCAAGAG AGCTTGATTT AAATGAAATT AAACGAATCC GAGAGACAAC ATCTCTTGAG
ATAGAATCCT TTGTTCATGG AGCACTTTGC TATTGCTATT CTGGTCAATG TTTTATGAGT
AGCATGATTG GAGGACGAAG CGGTAACAGA GGACGTTGTG CACAACCTTG CCGTATGGAA
TATAAGGTAT CTGAACAGGG GAAGCGTGTA TCAAGCACTG AGGAGAGCTA TATCTTAAGC
CCAAAGGATA TGTGTACTCT TTCTAATGTA GCGGATTTAA TGGAATCAGG AATTGACTCC
TTCAAAATTG AAGGTCGTAT GAAACGGTAC GAGTATAGTG CAGGGGTAGT GGAAGCATAT
CGAAAAGAAA TTGATCGCTA TCTTGAGCTA GGGCCTAACA AATATAGAGA ATTTCATAAA
AACAATCCAA AAGTATTAAA AGATGAAATG CTTCGTTTAC AGGATTTATA TAATCGTGGT
GGTTTTAATG AGGGATATTA CCAATCCCAT AATGGGAAAG AAATGATGTC AATGCATCGA
CCTAATCACA GTGGCGTCTT AGTTGGTAGT GTGAAAGCGA AAAAAGGAAT ACAAGCAGAG
ATTTTACTAA AAGAAGATGT AAATGCACAA GATATCTTAG AATTTCGTAA TTCAGGTGAG
AAAACCTATG ATTTTACAGT AAAGGACGGG GTAGGGAAAG GGAGTATTCT AACTACCAAT
GTGAAACCTG GAAGTAAAAT AGAGATTGGC GATGAGGTTT ACCGTACCAA GAACGAGTCC
TTGTTATCAG AACTATCAAT GAAATACTAT GACGCAGATA AGAAAGTCTT AGCTTATGGC
AGTTTTTATG CAAAAGTGGG AGAACCAATG AATCTTACTG TTTGGGCAGA TATTTCAAGA
GATAGCTTAA CGAACAAGAC GGAGCAGGTT ACGATTACCG AATACGGTGA GATTGTAATG
GAAGCCAAAA ACCAGCCAAT GTCTGAAGAG AATATTAGGG AAAAATTAAT GAAAACCAAA
GACACGCCGT TTGAGTTTGA TAACCTTACC ATATCCTTGG AGGGAAATGT TTTTTTACCT
GTTAGTAAGC TAAATGAACT TAGAAGAACT GCACTGATAC AGTTAGAAGA AGCAATGGCA
TCTGCTTTTC GTCGTGAGGT TCTAGATCTT TCAGACAATC ATCCTAAACA GGTTGATTCA
AAAACTAAGA ATCAATTTGG ACTAATAGCT ACCATTCGAA ACAAAAGCCA GCTACCCCCA
GTACTTGATT GTCCGGAAGT ACAAATTGTA TATCTTGATC TAGCAGAGCT TTCAAGAGCA
GAATTACCGA ATTTTGTTGC TAGGTGCAAA GAGAAAGGGA AAAAAGTATT TCTTTTACTT
CCACATATTA TGAGAGCATC CACATATGAT GAATATTTAA AGAATAAAGA GTTGTGGATG
GTCGATACTA TAGATGGGTA TGTGATTAAG AACTTTGAGG AACTCTACCT TCTTCGAAAC
GCACTAGAAA CGAGCAAAGA AATCCGTTTG GATTATAATA TGTATGTCTT AAATCATGAA
GCAGTGAAGT TTTATCAAGA TCTTGGAATC AATAATTTTA CTGCATCCAT TGAACTTAAT
CAATCGGAGT TAATGAGATT GGGTTGCAGG AATTTCGATC TTTTGGTGTA TGGATATTTA
CCACTCATGG TTTCTGCTCA ATGTGTTAGA AAAAATACAA CAGAGTGTAA ACCGGGAAAT
CTAAATCAAC CGGCCATTTT GGAAGACCGT ATGCATATGA AGTTTCGTGT GAAAACCAAT
TGTAACTACT GTTATAATAC CATTTACAAT TCGAAGTGCT TATCGTTACT GTCGAATCAA
TCTGAGGTAT TAGCTCTAGC TCCAAATAAT CTTAGACTCG ATTTTACTTT TGAGGATGAG
AATGAGGTTG CCGAGGTATT AAGGCAATTT GTTCGTGTTT ATTGTTATGG AGAAACAGGA
AGTTTACCAG GTGAGGATTA TTCCAAAGGG CATTTTAAAC GGGGTATCCT ATAA
 
Protein sequence
MKKVEILAPA GSIESMRAAI NAGCDAVYIG GMKFGARAYA NNLTEDMLLD AIDYAHVRGK 
QLYLTVNTLL KEEELEDSLY EYIRKYYEHG LDAVIVQDVG VMRFIHRNFP DLPIHASTQM
TLTMAQGADV LKEFGVTRMV ISRELDLNEI KRIRETTSLE IESFVHGALC YCYSGQCFMS
SMIGGRSGNR GRCAQPCRME YKVSEQGKRV SSTEESYILS PKDMCTLSNV ADLMESGIDS
FKIEGRMKRY EYSAGVVEAY RKEIDRYLEL GPNKYREFHK NNPKVLKDEM LRLQDLYNRG
GFNEGYYQSH NGKEMMSMHR PNHSGVLVGS VKAKKGIQAE ILLKEDVNAQ DILEFRNSGE
KTYDFTVKDG VGKGSILTTN VKPGSKIEIG DEVYRTKNES LLSELSMKYY DADKKVLAYG
SFYAKVGEPM NLTVWADISR DSLTNKTEQV TITEYGEIVM EAKNQPMSEE NIREKLMKTK
DTPFEFDNLT ISLEGNVFLP VSKLNELRRT ALIQLEEAMA SAFRREVLDL SDNHPKQVDS
KTKNQFGLIA TIRNKSQLPP VLDCPEVQIV YLDLAELSRA ELPNFVARCK EKGKKVFLLL
PHIMRASTYD EYLKNKELWM VDTIDGYVIK NFEELYLLRN ALETSKEIRL DYNMYVLNHE
AVKFYQDLGI NNFTASIELN QSELMRLGCR NFDLLVYGYL PLMVSAQCVR KNTTECKPGN
LNQPAILEDR MHMKFRVKTN CNYCYNTIYN SKCLSLLSNQ SEVLALAPNN LRLDFTFEDE
NEVAEVLRQF VRVYCYGETG SLPGEDYSKG HFKRGIL