Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2192 |
Symbol | cotS |
ID | 4205791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2420600 |
End bp | 2421601 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 642566742 |
Product | spore coat protein CotS |
Protein accession | YP_699492 |
Protein GI | 110802573 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR02906] spore coat protein, CotS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00336173 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGAG AGTTTGAAAT AGAAAGACAA TTTGATATTA AAATAGAAAA ATTAAAACCT AGCAAGGGTG TATATTATTT AAAAAGTAAT AAGGGTGACA GATGTTTAAA AAGGATAAAC TATGGAACTC AAAAACTTCT TTTCGTTTAT GGAGCAAAGG AGCATTTAGC TAAAAATGGA TTTGAACATA TAGATAGATA TTTCTTAAAT ATTGAAGATG AGCCTTATGC TCTAGTAAAT GAGGATTTAT ATACTCTTTC AAATTGGATA AAGGGAAGAG AGTGTGATTT CACTAACATA GAAGAGGTTA AATTAGCTGC TAAAAAGTTA GCTGAATTAC ATGAAGCTAG CAAGGGATAT GATCCACCAG AAAACTCAAA ATTAAAAAGT GATTTAGGAA GATGGCCATA TCTTATGGAA AAGAGAGGCA AAGCCTTAGA AAAAATGAGA GGAATGGCTA GAAAGAAAAA TTTAAAAAAA GATTTTGATA TTATTTATAT AAAAAATGTT GATTTTTATA AGGAGTTAGC AATAAGAGCC ACAAAAATAT TAAATAATTC AAAGTATTTA AGTTTATGTG AAGAAGCAGA GGCTGAGAAA GTATTTTGTC ATCATGATTA TACTTATCAC AATATAATAA TTGGAGATGA TAATGAAGTA TATATAATAG ACTTTGATTA TTGTAAAAGA GAAATAAGAA CATATGACAT AGCTAACTTC ATGAAGAAGG TTTTAAAAAG AGTTGACTGG AATATTGAAT ATGCAGAGGC CATAATAAAT GCTTATAATA CAGTAAGTCC ATTAAGGGAA GAAGAATATG AGGTATTATA TGCATACTTG TTATTCCCAC AAAGATATTG GAGACTTGCA AATAGATACT ACTATAATGA AGTTATGTGG GGACAAAATA TCTTTATAAA TAAAATAAAC AACATAATTA ATGAGAAAGA AAGTTATATG AAATTTATTG AAGAATTTAA AAGCAAATAT AATCAAGCTT AG
|
Protein sequence | MMREFEIERQ FDIKIEKLKP SKGVYYLKSN KGDRCLKRIN YGTQKLLFVY GAKEHLAKNG FEHIDRYFLN IEDEPYALVN EDLYTLSNWI KGRECDFTNI EEVKLAAKKL AELHEASKGY DPPENSKLKS DLGRWPYLME KRGKALEKMR GMARKKNLKK DFDIIYIKNV DFYKELAIRA TKILNNSKYL SLCEEAEAEK VFCHHDYTYH NIIIGDDNEV YIIDFDYCKR EIRTYDIANF MKKVLKRVDW NIEYAEAIIN AYNTVSPLRE EEYEVLYAYL LFPQRYWRLA NRYYYNEVMW GQNIFINKIN NIINEKESYM KFIEEFKSKY NQA
|
| |