Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1978 |
Symbol | |
ID | 4206590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 2185546 |
End bp | 2186625 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 25% |
IMG OID | 642566528 |
Product | hypothetical protein |
Protein accession | YP_699287 |
Protein GI | 110801820 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR02906] spore coat protein, CotS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAAGC ATTCTAAAAA GTTTAATGAA GCTGATGAAA TTTTTAATGC TGTAGAGTCC ATAGTTTTAC CTATGTATAA TTTAGAAAAC TATTCTATAG AGAATATAAA ATTTAAAAAT ACAGATAAAA ATAGAGCTGT TTATAAATTA ATTGATGATA TTAATAATCC TAAAAATACT TTTTGCTTAA AAAAGGTTTA TTATGATGAA GGGACTCTCT TATTCATATA TTCAGTTATG GAATGGTTTG CTAGAAATGA AATTAAGCTT CCAAAAATGC TACCTTCAAA GTTTAATGGT AGATTTGTTA AAGCAAATAA TATGCTTTTT ATGCTCTGTC CATGGGTTAA AGGTGAAAAA TGTAACTTCG ATAACTTACA ACATATCTTA CTATCCATAG AAAATCTAGC TAAAATGCAT AATTGTTCAA GAAACTTTAA AGCAATTGAA GGTAGTTTAA TTAAAACTGG ATTTGATAGT CTCTACATAT CCACATTAAA ACACTTTAAT AAGATTCTTT CATCATTTAA TACTGCAACT AAAATGAAAC ATAAGGACAA GTTTTCATCA ATATTTTTAG ATGTTTTTGA TGAAAATATT TATCTAGCTA AAGAAGCTCT CTTAGTTTCA GGAGCTATTA ATAATAAAAA TTTAAGTAGA TCTCTTTGCC ATGGAGATTA TGTAAATAAA AATATTTTAA TTGATAATAC TGATGTTTGG GTAATTGATT TTGATAAAGC ATCCTTAAAT TATTCTATGT ATGATTTATG TTATTTCATG AGACGTTTAT TAAAAAGATC AAATACTAAT TGGGATATAG ACTTAACAAG AAAGATAATT AAAACATATA ATTCAATTGC TCCTCTTACA GAGGATGACT TCAAATATGT TTTTTCATAT CTAGCATTTC CACAAAAATA TTGGCGCTTA TCAAAGGACT ATTATAATAA CATAAAAAAA TGTAATAAAT CAATGTTTGT AGAATCTCTT AAGGAAGTTG CACTAGATAC TTATGCTCAG GTTAGATTTG TTGAAGAACT TAGAACATTA TTAAAAAAAG ATTTTCTACT TACTCTATAA
|
Protein sequence | MGKHSKKFNE ADEIFNAVES IVLPMYNLEN YSIENIKFKN TDKNRAVYKL IDDINNPKNT FCLKKVYYDE GTLLFIYSVM EWFARNEIKL PKMLPSKFNG RFVKANNMLF MLCPWVKGEK CNFDNLQHIL LSIENLAKMH NCSRNFKAIE GSLIKTGFDS LYISTLKHFN KILSSFNTAT KMKHKDKFSS IFLDVFDENI YLAKEALLVS GAINNKNLSR SLCHGDYVNK NILIDNTDVW VIDFDKASLN YSMYDLCYFM RRLLKRSNTN WDIDLTRKII KTYNSIAPLT EDDFKYVFSY LAFPQKYWRL SKDYYNNIKK CNKSMFVESL KEVALDTYAQ VRFVEELRTL LKKDFLLTL
|
| |