Gene CPR_0815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0815 
Symbol 
ID4205918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp941279 
End bp943693 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content29% 
IMG OID642565374 
Productbeta-galactosidase 
Protein accessionYP_698140 
Protein GI110803978 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.57434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA TAATTCATAT CAATGATCAA TGGTTTTATG CAAATGATTA CAAAGGTGAG 
TATTTAAAAA ATGAGTTTGA TTTTAGTAAT TTTGAAAGGG TAGATTTACC TCATACAAAT
ATAGAGTTAC CATATAATTA TTTTGATGAG AAATCATATC AATTTGTTTC TACTTATGTT
AAAACTTTAA AATTTGATAA TAGTGTTAAA GGGAAAAAGG TATTTTTAGA TTTTGAAGGA
GTTATGATAG CCGCAGAAGT ATATTTAAAT GGAATTCATG TTGGTGGACA TAAGGGTGGT
TACACTAATT TTTCAATAGA TATTACTGAT GCTTTAAAAA TTAATGAAGA TAATATATTA
AAGGTTGTTG TTGACTCAAC AGAAAGACCG GATATACCTC CTCATGGATA TGTTGTTGAT
TATTTAACCT ATGGAGGAAT ATATAGAGAA GTTTCCTTAA GAGTAGTTGA ACCTATATTT
ATAAATAATT TATATGCAAG GGCATATGAT TGTTTAAAGG AAGAAAAAAG ATTAGAACTT
TATATAGAAA TAAATAATTT TGAAAAATAT AGAGATGATT TAGAAATTGT TGTAGACTTT
GGTGATGATA CTTTTGAAGA AACTTTAAGT ACAAAGTTAC CAATTGAAGA AGGAACAAGT
ATTAAAAATA TAGAAATAGA CCAATTAAAT ATGGTTAAGT TATGGGATAT AGAAAATCCT
AAGCTTTATG AAATAAAAGT TAAGTTATTA AAAGGTTCAG AAGTTATAGA TGAATATAAA
GATAATTTTG GATTTAGAGA GGCTGAATTT AGATCAGATG GTTTCTATTT AAATGGAAGA
AGAGTTAAAC TTGTTGGATT AAATCGTCAT CAGGCTTATC CATATGTAGG ATATGCTATG
CCTCAAAGAG TTCAAGAAAA GGATGCTGAG ATTTTAAAAT ATGAATTAGG ACTCAACATA
GTTAGAACAT CTCACTATCC GCAATCAGTA CACTTCTTAA GAAAATGTGA TGAGATTGGA
CTATTAGTTT TTGAAGAGAT ACCTGGTTGG CAACATATAG GTGATGAAGC TTGGCAAGCA
GAATCTATTA AAAATGTAGA GGAAATGATA AAAAAAGATT ACAATAGACC TTCCATAGTT
TTATGGGGCG TTAGAATAAA TGAGTCTCAA GATAGTCATG ATTTTTATGT GAAAACTAAT
GCTATGGCAA AGAGTTTAGA TCCTATTAGA CAAACTGGTG GGGTTAGATA CTTAGAAAAT
AGTGATTTCC TAGAAGATGT TTATACCATG AATGATTTTA TACACAGTGG GGGAGAAAAA
GTATTAAGAA CTCAAAGTGA AGTAACAGGA CAAGTAGATA AAGTTCCTTA TTTAGTAACT
GAGTATAATG GGCATATGTA TCCAACAAAA AGCTTTGATC AAGAATGTAA AAAAGTTGAA
CATGCTTATA GACATTTGAG AGTTATTAAT GAATCCTTTG GCTTAGATGA AATAAGTGGA
GCCATAGGAT GGTGTGCTTT TGATTATAAT ACACATAGTT CCTTTGGTTC AGGAGATAAA
ATTTGTTACC ATGGAGTTTC TGATATGTTC AGAAATCCTA AGTATGCAGC TTATTCCTAT
GCTAGCCAAA AGAAAGTAGA AGATGGTGTG GTTTTAGAAC CTATTACTTT AGGGGCTAAG
GGAGAAAGGG ATGGAGGAGC AATACTTCCA TTTACAGTTC TTACAAACTG TGATTATATA
AAAATATTTA AAGATGGAAT ATATATAGAT ACTTATTATC CTAATAAAGA AAAGTTCCCT
AATTTACCAC ATCCACCAAT AGAGGTTTCA CATATTTTAT CTATGGATTC AGAAATACCT
CTTACTGAAG AAGCAAAAAA AGAAATTAAA GACTTTGTAT TAAATAAATT AAAAGATTCT
AATTTAACTA ATTTAGCTGA AGAAGATTTT AAATATATTG AAGAATTTAG TGAAAGAGTA
AATATACCTG TATTTAAAAT AATGTCTTTA GTTTATAAAT TAGCTGGAGG TTGGGGAGAT
AAGGAAAACT CCTTAATAAT AAAAGGCTTT ATAGATAATA AAGAGGTTGC TTCTAAAGAA
ATAGGTGAAC TTAGAAGCAT GAATAAACTT GAAGTTACAC CAGATAATTT AGAACTTTCA
TTAGATAAAA CAAGTTATGA TGCTACTAGA ATCGTGGTTA AACTTTTAGA TAATTTAGGA
GAGGTTCTTT TCTTAAATAA TGATTTTATT GAAGTAGAAA TAGATGGACC TTTAAGTATA
ATGGGACCAA GTAAGTTTGG AATTTCTGGT GGAGCAGTCG CTTTCTGGGT AAGAACTCAA
GGAAAAACTG GGCTTTGCAA AATAAAGGTT AAGAGCATGT ACTTTGAAGA AGAAATTTCT
ATAGAAGTTA AGTAG
 
Protein sequence
MRKIIHINDQ WFYANDYKGE YLKNEFDFSN FERVDLPHTN IELPYNYFDE KSYQFVSTYV 
KTLKFDNSVK GKKVFLDFEG VMIAAEVYLN GIHVGGHKGG YTNFSIDITD ALKINEDNIL
KVVVDSTERP DIPPHGYVVD YLTYGGIYRE VSLRVVEPIF INNLYARAYD CLKEEKRLEL
YIEINNFEKY RDDLEIVVDF GDDTFEETLS TKLPIEEGTS IKNIEIDQLN MVKLWDIENP
KLYEIKVKLL KGSEVIDEYK DNFGFREAEF RSDGFYLNGR RVKLVGLNRH QAYPYVGYAM
PQRVQEKDAE ILKYELGLNI VRTSHYPQSV HFLRKCDEIG LLVFEEIPGW QHIGDEAWQA
ESIKNVEEMI KKDYNRPSIV LWGVRINESQ DSHDFYVKTN AMAKSLDPIR QTGGVRYLEN
SDFLEDVYTM NDFIHSGGEK VLRTQSEVTG QVDKVPYLVT EYNGHMYPTK SFDQECKKVE
HAYRHLRVIN ESFGLDEISG AIGWCAFDYN THSSFGSGDK ICYHGVSDMF RNPKYAAYSY
ASQKKVEDGV VLEPITLGAK GERDGGAILP FTVLTNCDYI KIFKDGIYID TYYPNKEKFP
NLPHPPIEVS HILSMDSEIP LTEEAKKEIK DFVLNKLKDS NLTNLAEEDF KYIEEFSERV
NIPVFKIMSL VYKLAGGWGD KENSLIIKGF IDNKEVASKE IGELRSMNKL EVTPDNLELS
LDKTSYDATR IVVKLLDNLG EVLFLNNDFI EVEIDGPLSI MGPSKFGISG GAVAFWVRTQ
GKTGLCKIKV KSMYFEEEIS IEVK