Gene CPR_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1959 
Symbol 
ID4204388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2163904 
End bp2165019 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content27% 
IMG OID642566509 
Producthypothetical protein 
Protein accessionYP_699268 
Protein GI110801772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.987661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAT GGCTAATAGC TATAGGTGGT ATTTTAATAA TTGCTATTTT TATATTATTT 
TTTAATAGAG AGTATTTAAA ACCAGTACAT ATGGATGTTA ATTGGAATGA TAATTTCAAT
CCACCAGAAG AAAAAATGCT TTTTGATTTT ATAGAGAAGG ATTTAAGTAA AAGTGGATAT
GGAATATATA CAAATTATAT AGATAAAAGT TCAGAAGGAG ATATAACTAA GGGGTACTCA
GTATTATCTG AGTCAGAAGG GCTTATGATG TTATATTCGG TAAATTCTAA TAATAAAGAA
TTATTTGATG AGCATTTTGA CATAGTAAAA GAAATGAGAT TAAAAAATGG ACTTATTAGT
TGGAGGAAAG AAGGAGATAA AAATTCACCG TCCTCTGCAA CTATAGATGA ACTTAGAATA
ATAAAAGCTC TTCTTTTAGC CAGCAACAGA TGGAATAGTT TTTATTATAA ATTTTATGCT
ATAAATATTG CTAACTCTTT ACTTAAACAT GCAGAAGAAA ATAAAACTTT AGTAGATTAT
ATAGATGACT ATGGAAAAGG GAATACAACT ACTTTATGTT ATTTAGACTT GCCTACTATG
AAATTATTGA GTCAAGTAGA TAAGAAGTGG GAAGGAATTT ATGAAAAATC TAACGGTATA
ATAGAAAATG GAAGAATATC TGAAGAGGTT CCTTTATATA GAAAAGTATT TTATGAAGAA
ACTCAAAAAT ATGATGAAGA AGAAAATGTT GATTTCTTAT TATCTACAAT AGTAATTTTA
AATAGAATTG AAGCTGGAGA AAATGAGGAG TCATCTATTA AATGGATAAA AGAAAAGTTT
AAGAAAGACG GATTCTTAGT AGCTACATAC AATGGTAAAA ATGGAGATGC TACCTCACAG
ATTGAATCTC CATCAATATA CTCTAATGTA GCTTTAATAG CAAATTACAT TGGAGATAAG
GAATTATTTA ACAAGGCTAT AGATAAATTA AAATATTATC AAATAAAAAA TAAAGATAGT
GTGCTTTATG GTGGATTTGG AGATGAAAAA ACAAATAGCG TATATTCTTT TGATAATTTA
AATGCACTAC TAGCTTTTCA AAAATATAAG GATTAA
 
Protein sequence
MKKWLIAIGG ILIIAIFILF FNREYLKPVH MDVNWNDNFN PPEEKMLFDF IEKDLSKSGY 
GIYTNYIDKS SEGDITKGYS VLSESEGLMM LYSVNSNNKE LFDEHFDIVK EMRLKNGLIS
WRKEGDKNSP SSATIDELRI IKALLLASNR WNSFYYKFYA INIANSLLKH AEENKTLVDY
IDDYGKGNTT TLCYLDLPTM KLLSQVDKKW EGIYEKSNGI IENGRISEEV PLYRKVFYEE
TQKYDEEENV DFLLSTIVIL NRIEAGENEE SSIKWIKEKF KKDGFLVATY NGKNGDATSQ
IESPSIYSNV ALIANYIGDK ELFNKAIDKL KYYQIKNKDS VLYGGFGDEK TNSVYSFDNL
NALLAFQKYK D