Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0703 |
Symbol | |
ID | 5743819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 909941 |
End bp | 911932 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641291815 |
Product | spore coat protein CotH |
Protein accession | YP_001557829 |
Protein GI | 160878861 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5337] Spore coat assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAACGA GTAAGTATAT CCCAGCCATC ACAGGCGTGA TTATTTGCTT TTGCCTATTT ATTTGCGGAT TCGTTGTTTA CGCCGCAAAC GCTTTTGACA CTACGAATAT CCCTGAATAT CAAAAAAGGC TATTTGGCGA TGAAATCATA ACGCTTGACA TTCAAGTGGA CGGCAGCGAT TGGCAGAGTC TTCTTGACAA TGCTCAGGCC AAGGAATGGA TATCCGCCGA CCTTATAATC AATGGGGAAA AGTTCAGTGG CGTAGGCGTG AGGACAAAAG GTAATTCAAG CCTTTCGTCA GCTATTACGT CAGACGGTGG GGGGAGATAT AGCCTGCAAT TCAAGTTAAA TAAATATATT AAAGGGCAGA CCTATTATGG ACTGGATACA TTTTGTATAA ATAACATGAT GGGCGACGCG ACGTATATGA AGGATTATCT CTCTTATGAG ATTATGAACT ATATCGGCGT TGCAACACCA CTCACAAATT ACGCCAGTGT AACGGTCAAC GGCGAGGACT ATGGTTTTTG CGTTGCGCTT GAACGGTACG AAGAAGCGTT TTTGGACCGC GCTTATAGTA CGTCAGCCGG ACAGTTGTAC AGTGTAAAAA TGGGAATGGG GATGCGCGGT AATTTCGAGG ATATGCAGCA GGATGGTGAG AACGGATTCC CTGGTAGAGG GGAAGCTTTA AATAACGAGG GATTCCCTGA CAGGCAGCAG AACGGAAACT TTGGTGATCG TTCACAGGGC GATGGTGGTA TAAACTTCGA GGACAGGCAG CAAGGCGGTG GCATCGGAAT GGGCGGCTTC GGTGGACAAG GCGGCGGTTC GCTTGTCTAC ACCGACGATA ACATAAGTAG CTATTCCGCA ATTTTTGAAA ATGCCGTGTT TAATAATGCT TCTAAGAAGG ACAAAAAGCG CGTAATCACA GCGCTTGAAA ATCTGAACAC TGGAACTGAT TTGGAAAAAT ATTTCGATGT TGACGAGATT TTACGCTACT TCGCGGCGCA TACGGTAGTT GTCAACCTTG ATAGCTACAT TTCCAATATG GCGCAGAATT ACTATATCTA TGAGCGCGAT GGAAAGTTGA GCATTTTACC GTGGGATTAC GGCCTTGCTT TCGGAGGATT CCAGTCAGGT GGTGCGTCTA GTGTTGTGAA TTTCCCTATA GACACGCCGG TAAGTGGAGT GAGCATGGAG GACAGACCTC TTCTTAATAT GCTGTTAGAA GTAGATGAAT ATAAGGAGAG ATACCACAGC TATCTGCAGC AAATCGTAGA AGGATATTTT GAGAGCGGGC TGTTTGAAAA GAAAATTAAC GAATTGGATG TAAAAATTAG TGAGTATGTA AAAAATGATG TTTCAGCATA CTATACCTAT GATCAGTATA AGGAGTCGTT ACCAAATTTA ATTGAGCTTG GGCATTTGCG TGCAGAAAGC ATCAAGGGTC AACTTAACGG GACAATTCCA TCAACATCAA ATGGGCAAAA CGCCGACAAT TCAGCCCTGA TAGACGCGGA GGGCGTCAAC CTTTCTGCTC TAGGTTCAAT GGGAGCAGGT GGTATGGGCG GCGGAATGGG CGGGCGCCCA GAGGTTGACG GGGATTTCTC GCTGGACGGT CAAGGAGGAT TTCCGGGAGG GAATATGGTA GATATGGCGT TAATGCAGCA GGCTATGCAA ATTCTCAGGG AAGCTGGCGG TGAATTGACG GATGAAGTAA AAGATGACCT TCTGGAACTA GGTTTGTCGG AAGAACAAAT AGACATGGTT ATTAGTATGC AGAATGGATT ACCGGACGGT ATGAATCAAC CTGGGGCTGA CAATGGTAAT GGCGGTCGGG GCGGCAGACA GGGTGATACA AACCTGACCA ACGCATCCGT GACATCATCT CAAATTAACT CCAGTACAAT GGTTATCCTC ACCGTCGGTA TGTTGATTTT TCTGATCTAC GCAACTATCT TCATCGCCAA ACCAAGAAAG AATATGATAT AA
|
Protein sequence | MITSKYIPAI TGVIICFCLF ICGFVVYAAN AFDTTNIPEY QKRLFGDEII TLDIQVDGSD WQSLLDNAQA KEWISADLII NGEKFSGVGV RTKGNSSLSS AITSDGGGRY SLQFKLNKYI KGQTYYGLDT FCINNMMGDA TYMKDYLSYE IMNYIGVATP LTNYASVTVN GEDYGFCVAL ERYEEAFLDR AYSTSAGQLY SVKMGMGMRG NFEDMQQDGE NGFPGRGEAL NNEGFPDRQQ NGNFGDRSQG DGGINFEDRQ QGGGIGMGGF GGQGGGSLVY TDDNISSYSA IFENAVFNNA SKKDKKRVIT ALENLNTGTD LEKYFDVDEI LRYFAAHTVV VNLDSYISNM AQNYYIYERD GKLSILPWDY GLAFGGFQSG GASSVVNFPI DTPVSGVSME DRPLLNMLLE VDEYKERYHS YLQQIVEGYF ESGLFEKKIN ELDVKISEYV KNDVSAYYTY DQYKESLPNL IELGHLRAES IKGQLNGTIP STSNGQNADN SALIDAEGVN LSALGSMGAG GMGGGMGGRP EVDGDFSLDG QGGFPGGNMV DMALMQQAMQ ILREAGGELT DEVKDDLLEL GLSEEQIDMV ISMQNGLPDG MNQPGADNGN GGRGGRQGDT NLTNASVTSS QINSSTMVIL TVGMLIFLIY ATIFIAKPRK NMI
|
| |