Gene CPR_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0156 
Symbol 
ID4205731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp190582 
End bp192651 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content34% 
IMG OID642564711 
Productbeta-galactosidase 
Protein accessionYP_697493 
Protein GI110802773 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAATT ATGGACCAAT AAGTTCTAAA GTAACAAAAA TGCTTCATGG AGCAGATTAC 
AACCCAGAGC AATGGATTGA TATACCTAAC ATATGGGGAG AAGATGTTAG ATTAATGAAG
CTTTCTCATA CAAATGTTGT AGCTGTAGGC ATATTTTCAT GGACAATGTT AGAGCCTGAA
GAAGGTAAGT TTAACTTTGA ATGGTTAGAT GAAATCATGG ATTTAATGCA CAAGAATGGA
AACTATGTAA TACTTGCTAC ACCAAGTGGA GCTAAGCCAA TATGGATGGC TCATAAGTAT
CCAGAAACTT TAAGAGTTGC TCCAAACAGA GTAAGAAACC TTTATGGAGA GCGTCATAAT
CACTGTTATA CATCTCCTAT ATATAGAGAG AAAATTGCTA TAATAGACAG ATTATTAGCT
GAAAGATATA AAGATCACCC AGCTTTAATC TTATGGCATA TTTCAAATGA GTTTGAAGGT
CAATGTTATT GTCCTTTATG TGAACAAGCT TTTAGAGATT TCTTAAGAAA GAAATATGAT
AATGACATAA ATAAATTAAA TAAAGCTTGG TGGACTAAGT TCTGGAGCCA CACATATGCT
TCCTTTGATG AAATAGAAGC ACCAGCTCCT CATGGAGAGC CAGCTCTTCA TGGATTAAAC
TTAGATTGGA TGAGATTTGT TACTCACCAA ACTTTAGATT ACTACAAGCA TGAAAGAAGC
ATATTAAAGG AAATCACACC AGATATTCCT GTAACTACTA ACTTCCATGA CTATATTTCA
TTATTTAGAG GAATAGATTA CTGGAAATTT GCTCCTTACT TAGATGTTGT ATCATGGGAT
AACTATCCTT ACTGGCATGG AGAAAGAACA GATGACCACG AAGGAAGTAG AATTGGATTT
ATTCATGACT TAAATAGAGC AATTTTAAAT GGTAAGCCAT TTATGATGAT GGAAAGCTCA
CCAAGTTCAA CAAACTGGCA ACCAGTTGCA AAACTTAGAC GTCCAGGAAT GCATGTTTTA
TCTTCTCTTC AAGCTGTAGC TCATGGTTCA GATACAGTTC AATATTTCCA ATGGAGAAAG
AGTAGAGGAT CATCAGAAAA ATTCCATGGA GCTGTTGTTG ATCACTGTGG ACATGAAAAT
ACAAGAGTAT TTAGAGATGT AACTAAGGTT GGAGAAATTC TATCAAAACT TGATGATGTA
ATAGGAACTT CTGTAGAGCC ACAAGTAGCT GTTATTTACG ATTGGGAAAA TTACTGGGCA
ATAAATGATG CTCAAGGTCC TAGAATAGAG CAAAAGGATT ACTTTGAAAC TTGTCAAAAG
CACTACAAAG CTTTCTGGGA TATGTCAATT CCAACAGATG TTATAAACAT GGATTGTGAT
TTCTCAAAAT ACAAGGTTGT TGTAGCACCA ATGCTTTATA TGGTAAGACC AGGTGTTGGA
GAAAGATTAG AGGAATTCGT AAATAATGGA GGAACTTTAG TAACAACTTA CTGGAGTGGT
ATTGTTGATG AGAATGACCT TTGCTTCTTA GGAGGATTCC CTGGCCCATT AAAGAAAGTT
ACAGGAATCT GGGCTGAAGA ATTAGATGCA CTTTATGATG AAGATGTAAA CTATGTTTCT
ATTGAGGAGG GAAATAGTTT AGGAATGAAA GGTGAATATG AAGCTAGAAT ATTTTGTGAC
TTAATTCACT CAGAAGGAGC TGAAATTCTT GCAACATACA AAACAGACTT CTACAAAGGA
ATGCCAGCTT TAACTTGTAA CAACTTTGGA GAGGGACAAG CTTATTATAT AGCATTTAGA
AATAATGATG AGTTCTTATC AGATTTCTAC TCAAGCTTAG CTAAGAAGTT AACTTTAAAG
AGAGCTATAG AAATTGACTT ACCAAATGGA ATTAATGCTC AAGTTCGTAT GGATGAAAAG
AATGAGTTTG TATTCTTTAT GAACTTCTCA TCAGAGGAAA AAACAATAGA CATTAAGGAT
CTTGATTTAA CAGATATGGT AACTGGTGAA AAGGTAACTA AGGAAATGGA AATAGAACCT
TATGGAGTTA GAATAGTTAG AAGAAAATAA
 
Protein sequence
MKNYGPISSK VTKMLHGADY NPEQWIDIPN IWGEDVRLMK LSHTNVVAVG IFSWTMLEPE 
EGKFNFEWLD EIMDLMHKNG NYVILATPSG AKPIWMAHKY PETLRVAPNR VRNLYGERHN
HCYTSPIYRE KIAIIDRLLA ERYKDHPALI LWHISNEFEG QCYCPLCEQA FRDFLRKKYD
NDINKLNKAW WTKFWSHTYA SFDEIEAPAP HGEPALHGLN LDWMRFVTHQ TLDYYKHERS
ILKEITPDIP VTTNFHDYIS LFRGIDYWKF APYLDVVSWD NYPYWHGERT DDHEGSRIGF
IHDLNRAILN GKPFMMMESS PSSTNWQPVA KLRRPGMHVL SSLQAVAHGS DTVQYFQWRK
SRGSSEKFHG AVVDHCGHEN TRVFRDVTKV GEILSKLDDV IGTSVEPQVA VIYDWENYWA
INDAQGPRIE QKDYFETCQK HYKAFWDMSI PTDVINMDCD FSKYKVVVAP MLYMVRPGVG
ERLEEFVNNG GTLVTTYWSG IVDENDLCFL GGFPGPLKKV TGIWAEELDA LYDEDVNYVS
IEEGNSLGMK GEYEARIFCD LIHSEGAEIL ATYKTDFYKG MPALTCNNFG EGQAYYIAFR
NNDEFLSDFY SSLAKKLTLK RAIEIDLPNG INAQVRMDEK NEFVFFMNFS SEEKTIDIKD
LDLTDMVTGE KVTKEMEIEP YGVRIVRRK