Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1959 |
Symbol | |
ID | 4204388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2163904 |
End bp | 2165019 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 642566509 |
Product | hypothetical protein |
Protein accession | YP_699268 |
Protein GI | 110801772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.987661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAAT GGCTAATAGC TATAGGTGGT ATTTTAATAA TTGCTATTTT TATATTATTT TTTAATAGAG AGTATTTAAA ACCAGTACAT ATGGATGTTA ATTGGAATGA TAATTTCAAT CCACCAGAAG AAAAAATGCT TTTTGATTTT ATAGAGAAGG ATTTAAGTAA AAGTGGATAT GGAATATATA CAAATTATAT AGATAAAAGT TCAGAAGGAG ATATAACTAA GGGGTACTCA GTATTATCTG AGTCAGAAGG GCTTATGATG TTATATTCGG TAAATTCTAA TAATAAAGAA TTATTTGATG AGCATTTTGA CATAGTAAAA GAAATGAGAT TAAAAAATGG ACTTATTAGT TGGAGGAAAG AAGGAGATAA AAATTCACCG TCCTCTGCAA CTATAGATGA ACTTAGAATA ATAAAAGCTC TTCTTTTAGC CAGCAACAGA TGGAATAGTT TTTATTATAA ATTTTATGCT ATAAATATTG CTAACTCTTT ACTTAAACAT GCAGAAGAAA ATAAAACTTT AGTAGATTAT ATAGATGACT ATGGAAAAGG GAATACAACT ACTTTATGTT ATTTAGACTT GCCTACTATG AAATTATTGA GTCAAGTAGA TAAGAAGTGG GAAGGAATTT ATGAAAAATC TAACGGTATA ATAGAAAATG GAAGAATATC TGAAGAGGTT CCTTTATATA GAAAAGTATT TTATGAAGAA ACTCAAAAAT ATGATGAAGA AGAAAATGTT GATTTCTTAT TATCTACAAT AGTAATTTTA AATAGAATTG AAGCTGGAGA AAATGAGGAG TCATCTATTA AATGGATAAA AGAAAAGTTT AAGAAAGACG GATTCTTAGT AGCTACATAC AATGGTAAAA ATGGAGATGC TACCTCACAG ATTGAATCTC CATCAATATA CTCTAATGTA GCTTTAATAG CAAATTACAT TGGAGATAAG GAATTATTTA ACAAGGCTAT AGATAAATTA AAATATTATC AAATAAAAAA TAAAGATAGT GTGCTTTATG GTGGATTTGG AGATGAAAAA ACAAATAGCG TATATTCTTT TGATAATTTA AATGCACTAC TAGCTTTTCA AAAATATAAG GATTAA
|
Protein sequence | MKKWLIAIGG ILIIAIFILF FNREYLKPVH MDVNWNDNFN PPEEKMLFDF IEKDLSKSGY GIYTNYIDKS SEGDITKGYS VLSESEGLMM LYSVNSNNKE LFDEHFDIVK EMRLKNGLIS WRKEGDKNSP SSATIDELRI IKALLLASNR WNSFYYKFYA INIANSLLKH AEENKTLVDY IDDYGKGNTT TLCYLDLPTM KLLSQVDKKW EGIYEKSNGI IENGRISEEV PLYRKVFYEE TQKYDEEENV DFLLSTIVIL NRIEAGENEE SSIKWIKEKF KKDGFLVATY NGKNGDATSQ IESPSIYSNV ALIANYIGDK ELFNKAIDKL KYYQIKNKDS VLYGGFGDEK TNSVYSFDNL NALLAFQKYK D
|
| |