Gene CPR_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2603 
Symbol 
ID4205810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2837508 
End bp2838944 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content30% 
IMG OID642567153 
Product6-phospho-beta-glucosidase bgla 
Protein accessionYP_699850 
Protein GI110802745 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATATG AAAAATTAGA TAACTTTAGA AGTGATTTCC TATGGGGATC AGCATCAGCA 
GCATATCAAG TTGAAGGTGG ATGGGATCAA GATGGAAAAG GAAAAAGTAT TTGGGATATA
TATACTAAAA AGGAAGGGAC AACCTATAAA AATACTAATG GGGATATAGC AGTAGATCAT
TATAACAGAT ATAAAGAAGA TGTATCTTTA ATGGCAGAAA TGGGATTAAA AGCTTATAGA
TTTTCAATTG CATGGACTCG TATATATCCT AATGGTAGAG GAAAGGTAAA TGAAGAAGGA
CTTAAGTTTT ATGAAAATTT AATTGATGAG CTAATAAAAA ATGATATAAC TCCAATAGTT
ACATTATATC ACTGGGATTT ACCACAGCAT CTACAGGATT TATATGGTGG ATGGGAATCA
AGGGAAATAA TAAACGATTT CAACAATTAT TGTGTAACTT TATTTAAGAG ATTTGGTAAT
AAGGTTAAAT ATTGGGTTAC GTTAAATGAA CAAAATGTAT TTTTAACATT AGGATATTTA
ACAGCATTAC ATCCACCAGG AGTTAAGGAT CAAAAGAGAA TGTTACAGGC AAATCATATT
GCAAATTTAG CAAATGCAAA GGTTATTGAA TCTTTTAGAA AGTATGTTTC AAATGGAATG
ATTGGTCCAA GCTTTGCATT CAATCCTAAT TATGCTTATA GTTGTAATCC ACAAGATGTG
TTAGCTGCAG AAAATGCAGA AGATTTAAAT TGTAATTGGT GGCTTGATGT TTATTGTAAG
GGTGTATATC CAACGTTTGC ATTAAGATAT TATGAAAGAT TAGGAATAGC ACCAATAATT
GAGGATGGGG ATCTTGAATT ATTAGCAAGA GTAAAACCAG ACTTTATTGG AATTAATTAT
TATCAAACAA CAACAGTTGC AATGAATTCA TTAGATGGGG TTGGAGCTTC TGAAGGAATG
AATAATACTG GTAAAAAGGG AACTACAAAG GAAAGTGGAA TACCAGGTGT ATATAAAAAT
GTAAAAAATC CTTACTTGGA AACAACTAAT TGGGATTGGA ATATAGATGC AACAGGGCTG
AGAACTGGTT TAAGAAGATT AACAAGTAGA TATGGATTGC CTATATTAAT TACAGAAAAT
GGCCTTGGTG AATTTGATAA GCTAGAAGAT AATATAGTAA ATGATGATTA TAGAATTAAA
TATTTAAAAG AACATATTAT TGCATGTAAA GAAGCAATTA CTGATGGTGT TGAGCTTTTA
GGATATTGTA CTTGGTCATT TACAGATTTG TTAAGCTGGC TTAATGGATA TCAAAAACGT
TATGGATTTG TATATATTGA TAGAGATGAA AATGATGAAA AAGATTTAAG AAGAATTAAG
AAGAAAAGTT TCTATTGGTA TAAGGATGTA ATAAGTAGTA ATGGTGAAAA TTTATAA
 
Protein sequence
MIYEKLDNFR SDFLWGSASA AYQVEGGWDQ DGKGKSIWDI YTKKEGTTYK NTNGDIAVDH 
YNRYKEDVSL MAEMGLKAYR FSIAWTRIYP NGRGKVNEEG LKFYENLIDE LIKNDITPIV
TLYHWDLPQH LQDLYGGWES REIINDFNNY CVTLFKRFGN KVKYWVTLNE QNVFLTLGYL
TALHPPGVKD QKRMLQANHI ANLANAKVIE SFRKYVSNGM IGPSFAFNPN YAYSCNPQDV
LAAENAEDLN CNWWLDVYCK GVYPTFALRY YERLGIAPII EDGDLELLAR VKPDFIGINY
YQTTTVAMNS LDGVGASEGM NNTGKKGTTK ESGIPGVYKN VKNPYLETTN WDWNIDATGL
RTGLRRLTSR YGLPILITEN GLGEFDKLED NIVNDDYRIK YLKEHIIACK EAITDGVELL
GYCTWSFTDL LSWLNGYQKR YGFVYIDRDE NDEKDLRRIK KKSFYWYKDV ISSNGENL