Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1336 |
Symbol | |
ID | 5743723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1667018 |
End bp | 1669471 |
Gene Length | 2454 bp |
Protein Length | 817 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641292441 |
Product | peptidase U32 |
Protein accession | YP_001558452 |
Protein GI | 160879484 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0126233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TAGAAATATT GGCACCTGCG GGTTCCATAG AAAGCATGAG GGCAGCAATC AATGCTGGCT GTGATGCTGT ATACATTGGC GGTATGAAAT TTGGTGCAAG AGCTTATGCA AATAATCTTA CGGAAGATAT GCTCTTAGAT GCAATCGACT ATGCGCATGT GAGAGGTAAG CAACTTTATC TAACGGTGAA TACGTTACTA AAAGAGGAAG AATTAGAGGA TTCCTTATAC GAATATATTC GAAAGTATTA TGAACATGGG TTAGATGCAG TAATTGTACA GGACGTTGGA GTTATGAGGT TTATTCATCG TAATTTTCCA GACCTTCCAA TCCATGCTAG TACCCAGATG ACATTAACCA TGGCACAGGG GGCTGATGTA TTAAAGGAGT TTGGTGTAAC CCGTATGGTT ATTTCAAGAG AGCTTGATTT AAATGAAATT AAACGAATCC GAGAGACAAC ATCTCTTGAG ATAGAATCCT TTGTTCATGG AGCACTTTGC TATTGCTATT CTGGTCAATG TTTTATGAGT AGCATGATTG GAGGACGAAG CGGTAACAGA GGACGTTGTG CACAACCTTG CCGTATGGAA TATAAGGTAT CTGAACAGGG GAAGCGTGTA TCAAGCACTG AGGAGAGCTA TATCTTAAGC CCAAAGGATA TGTGTACTCT TTCTAATGTA GCGGATTTAA TGGAATCAGG AATTGACTCC TTCAAAATTG AAGGTCGTAT GAAACGGTAC GAGTATAGTG CAGGGGTAGT GGAAGCATAT CGAAAAGAAA TTGATCGCTA TCTTGAGCTA GGGCCTAACA AATATAGAGA ATTTCATAAA AACAATCCAA AAGTATTAAA AGATGAAATG CTTCGTTTAC AGGATTTATA TAATCGTGGT GGTTTTAATG AGGGATATTA CCAATCCCAT AATGGGAAAG AAATGATGTC AATGCATCGA CCTAATCACA GTGGCGTCTT AGTTGGTAGT GTGAAAGCGA AAAAAGGAAT ACAAGCAGAG ATTTTACTAA AAGAAGATGT AAATGCACAA GATATCTTAG AATTTCGTAA TTCAGGTGAG AAAACCTATG ATTTTACAGT AAAGGACGGG GTAGGGAAAG GGAGTATTCT AACTACCAAT GTGAAACCTG GAAGTAAAAT AGAGATTGGC GATGAGGTTT ACCGTACCAA GAACGAGTCC TTGTTATCAG AACTATCAAT GAAATACTAT GACGCAGATA AGAAAGTCTT AGCTTATGGC AGTTTTTATG CAAAAGTGGG AGAACCAATG AATCTTACTG TTTGGGCAGA TATTTCAAGA GATAGCTTAA CGAACAAGAC GGAGCAGGTT ACGATTACCG AATACGGTGA GATTGTAATG GAAGCCAAAA ACCAGCCAAT GTCTGAAGAG AATATTAGGG AAAAATTAAT GAAAACCAAA GACACGCCGT TTGAGTTTGA TAACCTTACC ATATCCTTGG AGGGAAATGT TTTTTTACCT GTTAGTAAGC TAAATGAACT TAGAAGAACT GCACTGATAC AGTTAGAAGA AGCAATGGCA TCTGCTTTTC GTCGTGAGGT TCTAGATCTT TCAGACAATC ATCCTAAACA GGTTGATTCA AAAACTAAGA ATCAATTTGG ACTAATAGCT ACCATTCGAA ACAAAAGCCA GCTACCCCCA GTACTTGATT GTCCGGAAGT ACAAATTGTA TATCTTGATC TAGCAGAGCT TTCAAGAGCA GAATTACCGA ATTTTGTTGC TAGGTGCAAA GAGAAAGGGA AAAAAGTATT TCTTTTACTT CCACATATTA TGAGAGCATC CACATATGAT GAATATTTAA AGAATAAAGA GTTGTGGATG GTCGATACTA TAGATGGGTA TGTGATTAAG AACTTTGAGG AACTCTACCT TCTTCGAAAC GCACTAGAAA CGAGCAAAGA AATCCGTTTG GATTATAATA TGTATGTCTT AAATCATGAA GCAGTGAAGT TTTATCAAGA TCTTGGAATC AATAATTTTA CTGCATCCAT TGAACTTAAT CAATCGGAGT TAATGAGATT GGGTTGCAGG AATTTCGATC TTTTGGTGTA TGGATATTTA CCACTCATGG TTTCTGCTCA ATGTGTTAGA AAAAATACAA CAGAGTGTAA ACCGGGAAAT CTAAATCAAC CGGCCATTTT GGAAGACCGT ATGCATATGA AGTTTCGTGT GAAAACCAAT TGTAACTACT GTTATAATAC CATTTACAAT TCGAAGTGCT TATCGTTACT GTCGAATCAA TCTGAGGTAT TAGCTCTAGC TCCAAATAAT CTTAGACTCG ATTTTACTTT TGAGGATGAG AATGAGGTTG CCGAGGTATT AAGGCAATTT GTTCGTGTTT ATTGTTATGG AGAAACAGGA AGTTTACCAG GTGAGGATTA TTCCAAAGGG CATTTTAAAC GGGGTATCCT ATAA
|
Protein sequence | MKKVEILAPA GSIESMRAAI NAGCDAVYIG GMKFGARAYA NNLTEDMLLD AIDYAHVRGK QLYLTVNTLL KEEELEDSLY EYIRKYYEHG LDAVIVQDVG VMRFIHRNFP DLPIHASTQM TLTMAQGADV LKEFGVTRMV ISRELDLNEI KRIRETTSLE IESFVHGALC YCYSGQCFMS SMIGGRSGNR GRCAQPCRME YKVSEQGKRV SSTEESYILS PKDMCTLSNV ADLMESGIDS FKIEGRMKRY EYSAGVVEAY RKEIDRYLEL GPNKYREFHK NNPKVLKDEM LRLQDLYNRG GFNEGYYQSH NGKEMMSMHR PNHSGVLVGS VKAKKGIQAE ILLKEDVNAQ DILEFRNSGE KTYDFTVKDG VGKGSILTTN VKPGSKIEIG DEVYRTKNES LLSELSMKYY DADKKVLAYG SFYAKVGEPM NLTVWADISR DSLTNKTEQV TITEYGEIVM EAKNQPMSEE NIREKLMKTK DTPFEFDNLT ISLEGNVFLP VSKLNELRRT ALIQLEEAMA SAFRREVLDL SDNHPKQVDS KTKNQFGLIA TIRNKSQLPP VLDCPEVQIV YLDLAELSRA ELPNFVARCK EKGKKVFLLL PHIMRASTYD EYLKNKELWM VDTIDGYVIK NFEELYLLRN ALETSKEIRL DYNMYVLNHE AVKFYQDLGI NNFTASIELN QSELMRLGCR NFDLLVYGYL PLMVSAQCVR KNTTECKPGN LNQPAILEDR MHMKFRVKTN CNYCYNTIYN SKCLSLLSNQ SEVLALAPNN LRLDFTFEDE NEVAEVLRQF VRVYCYGETG SLPGEDYSKG HFKRGIL
|
| |