Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1100 |
Symbol | |
ID | 5741935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1389516 |
End bp | 1390538 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641292205 |
Product | peptidase M42 family protein |
Protein accession | YP_001558217 |
Protein GI | 160879249 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACGA AAGAGTACAT TATTAAAACG TTAGAAACAT TAGTGAACAT TCCAAGCCCA TCTGGGTTTA CGAAAGAAGT GATGGAGTTT GTTGAAAGAG AAGCTAATAA GTATCATTAT TCCTGTGAGT ATATCAAACG AGGTGGCCTA ATTATTACAG TTCCTGGAGA AAGTAATGAT TGCCTTGGTC TATCAGCTCA TGTAGATACA CTGGGTGCTA TGGTCCGTTC CATCGGTGGT GATGGTATAA TTCGCTTCAC CTGTGTTGGG GGCTACACAA TGAATAGTAT CGAAGGTGAA TATTGCAAAA TCCATACCAG AGAAGGAAAG GTGTATACCG GAACTATTCT TTCAAAAAGC CCATCGGTTC ATTCCTATGA TGATGCAAGA ACTCTTGAAC GCAAGGATCG AAACATGGAA ATCCGTATTG ATGAGAAAGT AGAGAATAAA GAAGATGTTT TAAACCTTGG AATCAATAAT GGAGATTATA TTAGCTTTGA TGCACGTTTT GAGTATACAA AAAGCGGTTT TATCAAATCC AGACATCTGG ATGATAAGGC AAGTGTAGCT GTTTTATTAG GTTTACTGAA AGATTTGTCT GAACAAAATG TTACACCAAA GAGAACTTTA AAGATTTTAA TCAGCAACTT AGAAGAAATC GGTATGGGCG CTTCCTATAT CCCATCTGAA ATTAGTGAAT TTATTGCTGT TGACATGGGA GCCATTGGTG ATGATTTAGC TGGTAATGAG TATTCCGTAT CAATTTGTGC ATTAGATTCT TCAGGTCCAT ATGATTATGA GTTAACAACA AAACTTATGA ATCTTGCAAA AGACAAGCAA ATCCCATATG TTGTAGACAT CTTCCCTCAT TATGGTTCTG ATGCATCTGC TGCTATGCGT GGTGGCAATA ATATTAGAGC AGCGTTAATT GGACAGGGAA TCCACGCTTC TCATGGGATG GAGCGTACTC ATGAAACAGC ATTAATTGCT ACTTTAGATT TATTAAAAGT ATATGTAGGA TAA
|
Protein sequence | METKEYIIKT LETLVNIPSP SGFTKEVMEF VEREANKYHY SCEYIKRGGL IITVPGESND CLGLSAHVDT LGAMVRSIGG DGIIRFTCVG GYTMNSIEGE YCKIHTREGK VYTGTILSKS PSVHSYDDAR TLERKDRNME IRIDEKVENK EDVLNLGINN GDYISFDARF EYTKSGFIKS RHLDDKASVA VLLGLLKDLS EQNVTPKRTL KILISNLEEI GMGASYIPSE ISEFIAVDMG AIGDDLAGNE YSVSICALDS SGPYDYELTT KLMNLAKDKQ IPYVVDIFPH YGSDASAAMR GGNNIRAALI GQGIHASHGM ERTHETALIA TLDLLKVYVG
|
| |