Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0574 |
Symbol | |
ID | 4205017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 682173 |
End bp | 683441 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 25% |
IMG OID | 642565134 |
Product | hypothetical protein |
Protein accession | YP_697901 |
Protein GI | 110801987 |
COG category | [S] Function unknown |
COG ID | [COG5542] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0845454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTTT GGGAAAAAGA TAAAAGATAT AGATATACAT TATGGTTATG TATATTAGTT ATGTTTGTTA TAGGAACAGT ATGTACATTA AAATATGGTA ATTACTTTTT ACTAGGGGAT TTAGATAAGT TAAACAACGA TGATGTAAGA TATTTACATA CAGCTAAAGT ATTAGCTGAA CAAGGGAAGT TGGTATATCA CAATATGGAT CCTACTTTAT TTATTATGCC AGGATACCCA ATTTTTATAG CACTAATAGT AAAAATTTTT GGAAGTGGAA GTTTAGGTAT AATAGCAATA AGAATGTCTC AATTAGTACT TCAATGTGTT TGCTTATATA TATTATATTT TTTAGCAAAA GAATTAGTAA ATAAAAAAAC AGCAATAATA GCATGTATTC TTACAGTATT ATATTTGCCA GAATATGTGG CAGCAAATCT TATATTAACT GAAGTATTAT ATAAGACTTT ATATATGCTT TTATTTTATT TTTCTATAAT TGCCATAAGG AAAAATAAAA CTAAATACTA TGTTTTTTCA GGTATAAGTT GGGCTTTAGT ATGTTTGGTA AGACCAAATG CAGCAGCATT TCCATTATTT ATAATAATTT TTTGGATAGT TAACAAGTAT TCAATTAAGG ATATGATAAA ATACACATCC ATAGTATTTG TAATATTTGT AACTCTTTTT TCTCCATGGT GGATTAGAAA TTACAAATTA ACAAATAAAT TTGTTTTATT TACAGAATCA TCAGCAAATC CTAAATTATT AGGTACATTT ATAAGATGGG GAGCTCCTAG TTTTTATAAG GATATACCAA AAGAATATAA ATATGATGAA TTTTTAAATG ACGAATATCT AACAGAAGAT GAACAAAATA ATTTAGCAAA TTATATGATT AAAAGAAGTT TTCAAGAAGA ACCTTTAAAG TACACTTATT GGTACACTTT AGGTAAAACT GAAGAGCTTT ATAAGGAAGC ATATTATTGG AAACCTATAT TTAGAGTTAA TGACACAAGG ATGAATTTTA CACATATTTC ATATATAACT CTTGGAATAT TAGGAATTAT TGCTATGATT AGAAGAAAGA TTAAAGGTGG AAAAATGTTA ATAGTGTTTT TACTTATAAA TACTGCCGTG TATCTTCCTT TTATAACTTT CTCAAGATAT GGATATCCAA ATATATTTGT ATTTATAATT GGAGCTGCAT ATACTTTGAA TGTTTTATTT TGTAAGGATG AAATACAAAG TGAAAAAAGC CTAATTTAG
|
Protein sequence | MTFWEKDKRY RYTLWLCILV MFVIGTVCTL KYGNYFLLGD LDKLNNDDVR YLHTAKVLAE QGKLVYHNMD PTLFIMPGYP IFIALIVKIF GSGSLGIIAI RMSQLVLQCV CLYILYFLAK ELVNKKTAII ACILTVLYLP EYVAANLILT EVLYKTLYML LFYFSIIAIR KNKTKYYVFS GISWALVCLV RPNAAAFPLF IIIFWIVNKY SIKDMIKYTS IVFVIFVTLF SPWWIRNYKL TNKFVLFTES SANPKLLGTF IRWGAPSFYK DIPKEYKYDE FLNDEYLTED EQNNLANYMI KRSFQEEPLK YTYWYTLGKT EELYKEAYYW KPIFRVNDTR MNFTHISYIT LGILGIIAMI RRKIKGGKML IVFLLINTAV YLPFITFSRY GYPNIFVFII GAAYTLNVLF CKDEIQSEKS LI
|
| |