Gene CPR_2192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2192 
SymbolcotS 
ID4205791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2420600 
End bp2421601 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content27% 
IMG OID642566742 
Productspore coat protein CotS 
Protein accessionYP_699492 
Protein GI110802573 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR02906] spore coat protein, CotS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00336173 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGAG AGTTTGAAAT AGAAAGACAA TTTGATATTA AAATAGAAAA ATTAAAACCT 
AGCAAGGGTG TATATTATTT AAAAAGTAAT AAGGGTGACA GATGTTTAAA AAGGATAAAC
TATGGAACTC AAAAACTTCT TTTCGTTTAT GGAGCAAAGG AGCATTTAGC TAAAAATGGA
TTTGAACATA TAGATAGATA TTTCTTAAAT ATTGAAGATG AGCCTTATGC TCTAGTAAAT
GAGGATTTAT ATACTCTTTC AAATTGGATA AAGGGAAGAG AGTGTGATTT CACTAACATA
GAAGAGGTTA AATTAGCTGC TAAAAAGTTA GCTGAATTAC ATGAAGCTAG CAAGGGATAT
GATCCACCAG AAAACTCAAA ATTAAAAAGT GATTTAGGAA GATGGCCATA TCTTATGGAA
AAGAGAGGCA AAGCCTTAGA AAAAATGAGA GGAATGGCTA GAAAGAAAAA TTTAAAAAAA
GATTTTGATA TTATTTATAT AAAAAATGTT GATTTTTATA AGGAGTTAGC AATAAGAGCC
ACAAAAATAT TAAATAATTC AAAGTATTTA AGTTTATGTG AAGAAGCAGA GGCTGAGAAA
GTATTTTGTC ATCATGATTA TACTTATCAC AATATAATAA TTGGAGATGA TAATGAAGTA
TATATAATAG ACTTTGATTA TTGTAAAAGA GAAATAAGAA CATATGACAT AGCTAACTTC
ATGAAGAAGG TTTTAAAAAG AGTTGACTGG AATATTGAAT ATGCAGAGGC CATAATAAAT
GCTTATAATA CAGTAAGTCC ATTAAGGGAA GAAGAATATG AGGTATTATA TGCATACTTG
TTATTCCCAC AAAGATATTG GAGACTTGCA AATAGATACT ACTATAATGA AGTTATGTGG
GGACAAAATA TCTTTATAAA TAAAATAAAC AACATAATTA ATGAGAAAGA AAGTTATATG
AAATTTATTG AAGAATTTAA AAGCAAATAT AATCAAGCTT AG
 
Protein sequence
MMREFEIERQ FDIKIEKLKP SKGVYYLKSN KGDRCLKRIN YGTQKLLFVY GAKEHLAKNG 
FEHIDRYFLN IEDEPYALVN EDLYTLSNWI KGRECDFTNI EEVKLAAKKL AELHEASKGY
DPPENSKLKS DLGRWPYLME KRGKALEKMR GMARKKNLKK DFDIIYIKNV DFYKELAIRA
TKILNNSKYL SLCEEAEAEK VFCHHDYTYH NIIIGDDNEV YIIDFDYCKR EIRTYDIANF
MKKVLKRVDW NIEYAEAIIN AYNTVSPLRE EEYEVLYAYL LFPQRYWRLA NRYYYNEVMW
GQNIFINKIN NIINEKESYM KFIEEFKSKY NQA