Gene CPR_2588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2588 
Symbol 
ID4205969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2819182 
End bp2820531 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content30% 
IMG OID642567138 
Productnucleoside recognition domain-containing protein 
Protein accessionYP_699835 
Protein GI110803952 
COG category[S] Function unknown 
COG ID[COG3314] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000362431 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAACTA AATATAAATC GTCTGATTTA TTAAAGTTTG TAATTCCATC TTTATTAGGA 
GTAATTTTAT TTATGGTTCC TATAAATGAT GGTGGAAATA TAACAATACC AGTAGCCTTT
TTCACAACTA AACTTAAGGA TTTAATAGGT GATTATTTAC CTACTTTATC AATGATAGTT
GTTGTAATAG CTGCTTCATT AACAATAATA ACTAAAGCAT TTAAGCCAAA ATTTATAATA
GAAAATAACT TTTTAAATTC ATTATTAAAT GTTAATTGGA TTTGGACACT TGGAAGAATC
TTAGGTGGAT TATTTATAAT ATCAGCATTT ACTGGATTTG GACCTGAAAT AATAAAGTCT
GATTTAACTG GGGCATTTAT ATTAAATGAT CTACTACCAA CACTTATAGT AGTATTCTTC
TTTGCAGGAT TATTCCTGCC GCTTCTTTTA AACTTTGGTT TATTGGAATT CTGTGGTGCC
TTACTAACTA AGATAATGAG ACCATTATTT AAACTACCTG GACGTTCATC AATAGACTGT
ATGACTTCTT GGTTAGGTGA TGGAACTATT GGGGTATTAC TTACTTCAAA ACAATATGAA
GAAGGATTTT ATACTGAGAG AGAAGCTTGT ATAGTTAGTA CAATGTTTTC AGTTGTTTCA
ATAACTTTTA GCTTTGTTGT TTTATCACAG GTTGGATTAG AGAATATGTT TATTCCATTC
TATTTAACTG TTACTTTTGC CGGAATAGTT GCCGCTATAA TTCTTCCAAG AGTTTGGCCT
CTTAGTAAAA AACCTGATGC TTATTTCAAC AATGCAGAAC CTAAGAATGT AGAAGATGTT
CCTGAAGGAT TTACTCCTTT CACTTTTGGA GTTTTTAAAG CAGTTGAAAA AGCCCAAAAT
GAAAGTAGTT TAAAAAAATT CTTTGCTGAT GGAATTAAAA ACGTACTTGA AATGTGGATA
GGAGTTTTAC CTGTAGTTAT GGCAATGGGT ACATTAGCTC TTATGATAGC TGAATATACT
CCTATATTTC AATGGCTTGG TGTTCCTTTC ATTCCTTTAT TTAAAATTTT AAGAATTCCT
GAAGCTTCAG CTGCTTCTCA AACTGTTATA GTTGGATTTG CAGATATGTT CTTACCATCA
GTTATAGCAT CAAAAACTAT TTTAAGTGAT ATGACTAAAT TTGTAGTTGC TTGTGTATCA
GTAACTCAAC TTATATACTT ATCAGAAGTT GGAAGTGTTA TTTTAGGATC AAAAATACCA
CTAAACTTAA AAGAATTATT TATGATTTTC TTAATGAGAA CACTAGTCAC TCTACCAGTT
ATAGCACTTA TAGCACACTT ATTATTCTAA
 
Protein sequence
MKTKYKSSDL LKFVIPSLLG VILFMVPIND GGNITIPVAF FTTKLKDLIG DYLPTLSMIV 
VVIAASLTII TKAFKPKFII ENNFLNSLLN VNWIWTLGRI LGGLFIISAF TGFGPEIIKS
DLTGAFILND LLPTLIVVFF FAGLFLPLLL NFGLLEFCGA LLTKIMRPLF KLPGRSSIDC
MTSWLGDGTI GVLLTSKQYE EGFYTEREAC IVSTMFSVVS ITFSFVVLSQ VGLENMFIPF
YLTVTFAGIV AAIILPRVWP LSKKPDAYFN NAEPKNVEDV PEGFTPFTFG VFKAVEKAQN
ESSLKKFFAD GIKNVLEMWI GVLPVVMAMG TLALMIAEYT PIFQWLGVPF IPLFKILRIP
EASAASQTVI VGFADMFLPS VIASKTILSD MTKFVVACVS VTQLIYLSEV GSVILGSKIP
LNLKELFMIF LMRTLVTLPV IALIAHLLF