Gene CPR_2503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2503 
Symbol 
ID4204014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2720229 
End bp2721248 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content27% 
IMG OID642567053 
Producttranscriptional regulator 
Protein accessionYP_699750 
Protein GI110801509 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.117668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTAA CAATAAATAA TATTGCAAAA GAAGCGGGAG TTTCTCTAGC TACAGTTTCT 
AGAGTTATAA ATAATTCTGG ATATGTAAAA AAGGAAACTC GTGAAAATGT TATAAAAGTT
ATAAATAAAT ATAATTATAC CCCTAGCGCT ATTGCTAGAA GTCTTTCAAA AAGTATTACT
AATACTATTG GAGTTATAGT TCCTGATATT ACAAATCCAT TCTTTGGATC AATAATAAAA
GGAATAAGTG ATGTTGCAGA AGTACATAAT CTAAATTTGA TTCTTTGTGA CTCTAATGAA
AGCATAGATA GAGAAATAAA AGCTATTAAG ACATTAAAGG AACAAAGAAT TAGAGGCATT
ATAATTTGTC CAACCTCTGT TGAAAATGAT TTAAATAGTG AGTATCTAAA AACTATTACT
AATTTAGGTA TACCAGTAAT ATTAATTGAT GGAAGTCTTA AATATCACAA TTTTAACGGT
GTTTTTGTGG ACAATATTAA AGGTTCTTAT GACGCCATAG AAGCATTAAT TAATGCTAAT
CATAAAGATA TAGCAATAAT TACTGGGCGT ATGACATCTA AACCAGCTCA GGATAGATTG
TTAGGATATG AAAAAGCATT ATTAATGAAT AATATTCCTA TAAATAATGA CTTAATTTTT
TATGGAAATT ATGAAGAAGA AAGTGGATAT GAATGTACTA AGAAAATCTT AGCTATGAAA
AATAGACCTT CAGCCATTTT TGTATGCAAT AATCTTATGA CTTTAGGTTG TTTAAAAGCT
TTAAGAGAAG CAAAACTTGA ATTATCAAAA GATATTTCAT TAATCTCCTT CGATAATATA
CCTATATTAG ATACACTAGG TATAAATATT AGTCATATAA ATGGGCCAAC TAAAGAACTT
GGAGAGATTG GTATGGATTT ATTAATAGAA TCCTTGAATA ATGACTCTAA AAAAGAATTA
AATAGTATAA CAATAACTCC TGAACTTGTT TTAAAAGGTT CAGAAAAGTT AATAAAATAA
 
Protein sequence
MSVTINNIAK EAGVSLATVS RVINNSGYVK KETRENVIKV INKYNYTPSA IARSLSKSIT 
NTIGVIVPDI TNPFFGSIIK GISDVAEVHN LNLILCDSNE SIDREIKAIK TLKEQRIRGI
IICPTSVEND LNSEYLKTIT NLGIPVILID GSLKYHNFNG VFVDNIKGSY DAIEALINAN
HKDIAIITGR MTSKPAQDRL LGYEKALLMN NIPINNDLIF YGNYEEESGY ECTKKILAMK
NRPSAIFVCN NLMTLGCLKA LREAKLELSK DISLISFDNI PILDTLGINI SHINGPTKEL
GEIGMDLLIE SLNNDSKKEL NSITITPELV LKGSEKLIK