Gene CPR_0574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0574 
Symbol 
ID4205017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp682173 
End bp683441 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content25% 
IMG OID642565134 
Producthypothetical protein 
Protein accessionYP_697901 
Protein GI110801987 
COG category[S] Function unknown 
COG ID[COG5542] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0845454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTTT GGGAAAAAGA TAAAAGATAT AGATATACAT TATGGTTATG TATATTAGTT 
ATGTTTGTTA TAGGAACAGT ATGTACATTA AAATATGGTA ATTACTTTTT ACTAGGGGAT
TTAGATAAGT TAAACAACGA TGATGTAAGA TATTTACATA CAGCTAAAGT ATTAGCTGAA
CAAGGGAAGT TGGTATATCA CAATATGGAT CCTACTTTAT TTATTATGCC AGGATACCCA
ATTTTTATAG CACTAATAGT AAAAATTTTT GGAAGTGGAA GTTTAGGTAT AATAGCAATA
AGAATGTCTC AATTAGTACT TCAATGTGTT TGCTTATATA TATTATATTT TTTAGCAAAA
GAATTAGTAA ATAAAAAAAC AGCAATAATA GCATGTATTC TTACAGTATT ATATTTGCCA
GAATATGTGG CAGCAAATCT TATATTAACT GAAGTATTAT ATAAGACTTT ATATATGCTT
TTATTTTATT TTTCTATAAT TGCCATAAGG AAAAATAAAA CTAAATACTA TGTTTTTTCA
GGTATAAGTT GGGCTTTAGT ATGTTTGGTA AGACCAAATG CAGCAGCATT TCCATTATTT
ATAATAATTT TTTGGATAGT TAACAAGTAT TCAATTAAGG ATATGATAAA ATACACATCC
ATAGTATTTG TAATATTTGT AACTCTTTTT TCTCCATGGT GGATTAGAAA TTACAAATTA
ACAAATAAAT TTGTTTTATT TACAGAATCA TCAGCAAATC CTAAATTATT AGGTACATTT
ATAAGATGGG GAGCTCCTAG TTTTTATAAG GATATACCAA AAGAATATAA ATATGATGAA
TTTTTAAATG ACGAATATCT AACAGAAGAT GAACAAAATA ATTTAGCAAA TTATATGATT
AAAAGAAGTT TTCAAGAAGA ACCTTTAAAG TACACTTATT GGTACACTTT AGGTAAAACT
GAAGAGCTTT ATAAGGAAGC ATATTATTGG AAACCTATAT TTAGAGTTAA TGACACAAGG
ATGAATTTTA CACATATTTC ATATATAACT CTTGGAATAT TAGGAATTAT TGCTATGATT
AGAAGAAAGA TTAAAGGTGG AAAAATGTTA ATAGTGTTTT TACTTATAAA TACTGCCGTG
TATCTTCCTT TTATAACTTT CTCAAGATAT GGATATCCAA ATATATTTGT ATTTATAATT
GGAGCTGCAT ATACTTTGAA TGTTTTATTT TGTAAGGATG AAATACAAAG TGAAAAAAGC
CTAATTTAG
 
Protein sequence
MTFWEKDKRY RYTLWLCILV MFVIGTVCTL KYGNYFLLGD LDKLNNDDVR YLHTAKVLAE 
QGKLVYHNMD PTLFIMPGYP IFIALIVKIF GSGSLGIIAI RMSQLVLQCV CLYILYFLAK
ELVNKKTAII ACILTVLYLP EYVAANLILT EVLYKTLYML LFYFSIIAIR KNKTKYYVFS
GISWALVCLV RPNAAAFPLF IIIFWIVNKY SIKDMIKYTS IVFVIFVTLF SPWWIRNYKL
TNKFVLFTES SANPKLLGTF IRWGAPSFYK DIPKEYKYDE FLNDEYLTED EQNNLANYMI
KRSFQEEPLK YTYWYTLGKT EELYKEAYYW KPIFRVNDTR MNFTHISYIT LGILGIIAMI
RRKIKGGKML IVFLLINTAV YLPFITFSRY GYPNIFVFII GAAYTLNVLF CKDEIQSEKS
LI