Gene CPF_0160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0160 
Symbol 
ID4202873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp189000 
End bp191069 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content35% 
IMG OID638081041 
Productbeta-galactosidase 
Protein accessionYP_694624 
Protein GI110800290 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAATT ATGGACCAAT AAGTTCTAAA GTAACAAAAA TGCTTCATGG AGCAGATTAC 
AACCCAGAGC AATGGATTGA TATGCCTAAC ATATGGGGAG AAGATGTTAG ATTAATGAAG
CTTTCTCATA CAAATGTTGT AGCTGTAGGC ATATTCTCAT GGACAATGTT AGAGCCAGAG
GAAGGTAAGT TTAACTTTGA ATGGTTAGAT GAAATCATGG ATTTAATGCA CAAGAATGGA
AACTATGTAA TACTTGCTAC ACCAAGTGGA GCTAAGCCAA TATGGATGGC TCATAAGTAT
CCAGAAACTT TAAGAGTTGC TCAAAACAGA GTAAGAAACT TATATGGAGA GCGTCATAAC
CATTGCTATA CATCTCCTAT ATATAGAGAG AAAATTGCTA TAATAGACAG ATTATTAGCT
GAAAGATATA AAGATCACCC AGCTTTAATT TTATGGCATA TTTCAAATGA GTTTGAAGGT
CAATGTTACT GTCCTTTATG TGAAGAAGCT TTTAGAGATT TCTTAAGAGA GAAATATGAT
AATGATATAA ATAAATTAAA TAAAGCTTGG TGGACTAAGT TCTGGAGCCA TACATATGCT
TCCTTTGATG AAATAGAAGC GCCAGCTCCT CATGGAGAAC CAGCTCTTCA TGGATTAAAC
TTAGACTGGA TGAGATTTGT TACTCACCAA ACTTTAGATT ATTACAAGCA TGAAAGAAGC
ATATTAAAGG AAATCACACC AGATATTCCT GTAACTACTA ACTTCCATGA CTATATTTCA
TTATTTAGAG GAATAGATTA CTGGAAGTTT GCTCCTTACT TAGATGTTGT ATCATGGGAT
AACTATCCTT ACTGGCATGG TGAAAGAACA GATGACCACG AAGGAAGTAG AATTGGATTT
GTTCATGACT TAAATAGAGC AATTTTAAAT GGTAAGCCAT TTATGATGAT GGAAAGTTCT
CCAAGTTCAA CAAACTGGCA ACCAGTTGCA AAACTTAGAC GTCCAGGAAT GCATGTTTTA
TCTTCTCTTC AAGCTGTAGC ACATGGTTCA GATACAGTTC AATACTTCCA ATGGAGAAAG
AGTAGAGGAT CATCAGAAAA GTTCCATGGA GCTGTTGTTG ATCACTGTGG ACATGAAAAT
ACAAGAGTAT TTAGAGATGT AACTAAGGTT GGAGAAATTC TATCAAAACT TGATGATGTA
ATAGGAACTT CTGTAGAACC ACAAGTAGCT GTTATTTACG ATTGGGAAAA CTACTGGGCA
ATAAATGATG CTCAAGGTCC TAGAATAGAG CAAAAGGATT ACTTTGAAAC TTGTCAAAAG
CACTACAAAG CTTTCTGGGA TATGTCAATT CCAACAGATG TTATAAACAT GGATTGCGAT
TTCTCAAAAT ATAAGGTTGT TGTAGCACCA ATGCTTTATA TGGTAAGACC AGGTGTTGGA
GAAAGATTAG AGGAATTCGT TAAAAATGGA GGAACTTTAG TAACAACTTA CTGGAGTGGT
ATTGTTGATG AAAATGACCT TTGCTTCTTA GGAGGATTCC CTGGCCCATT AAAGAAAGTT
ACAGGAATCT GGGCAGAAGA ATTAGATGCA CTTTATGATG AAGATGTAAA CTATGTTTCT
GTTGAAGAAG GCAATAGCTT AGGAATGAAA GGTGAATATG AGGCTAGAAT ATTCTGTGAC
TTAATTCACT CAGAAGGAGC TGAAGTTCTT GCAACATACA AAACAGACTT CTACAAAGGA
ATGCCAGCTT TAACTTGCAA CAACTTTGGA GAAGGACAAG CTTACTATAT AGCATTTAGA
AATAATGATG AGTTCTTATC AGATTTCTAC TCAAGCTTAG CTAAGAAGTT AACTTTAAAG
AGAGCTATAG AAATTGACTT ACCAAAGGGA ATTAATGCTC AAGTTCGTAT GGATGAAAAG
AACGAGTTTG TATTCTTTAT GAACTTCTCA TCAGAGGAAA AAACAATAGA CATTAAAGAT
CTTGATTTAA CAGATATGGT AACTGGTGAA AAGGTAGCTA AGGAAATGGA AATAGAGCCT
TATGGAGTTA GAATAGTTAG AAGAAAATAA
 
Protein sequence
MKNYGPISSK VTKMLHGADY NPEQWIDMPN IWGEDVRLMK LSHTNVVAVG IFSWTMLEPE 
EGKFNFEWLD EIMDLMHKNG NYVILATPSG AKPIWMAHKY PETLRVAQNR VRNLYGERHN
HCYTSPIYRE KIAIIDRLLA ERYKDHPALI LWHISNEFEG QCYCPLCEEA FRDFLREKYD
NDINKLNKAW WTKFWSHTYA SFDEIEAPAP HGEPALHGLN LDWMRFVTHQ TLDYYKHERS
ILKEITPDIP VTTNFHDYIS LFRGIDYWKF APYLDVVSWD NYPYWHGERT DDHEGSRIGF
VHDLNRAILN GKPFMMMESS PSSTNWQPVA KLRRPGMHVL SSLQAVAHGS DTVQYFQWRK
SRGSSEKFHG AVVDHCGHEN TRVFRDVTKV GEILSKLDDV IGTSVEPQVA VIYDWENYWA
INDAQGPRIE QKDYFETCQK HYKAFWDMSI PTDVINMDCD FSKYKVVVAP MLYMVRPGVG
ERLEEFVKNG GTLVTTYWSG IVDENDLCFL GGFPGPLKKV TGIWAEELDA LYDEDVNYVS
VEEGNSLGMK GEYEARIFCD LIHSEGAEVL ATYKTDFYKG MPALTCNNFG EGQAYYIAFR
NNDEFLSDFY SSLAKKLTLK RAIEIDLPKG INAQVRMDEK NEFVFFMNFS SEEKTIDIKD
LDLTDMVTGE KVAKEMEIEP YGVRIVRRK