Gene CPF_0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0766 
Symbol 
ID4202276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp908589 
End bp911627 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content28% 
IMG OID638081650 
Productbeta-galactosidase 
Protein accessionYP_695217 
Protein GI110799026 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.156765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTAA AAAATGAATA CCATGAAGAT ATTTCTAAAC TGCATGTAAA TATGATGCCA 
AGAAGAAGTT ATTATGTTCC TTTTGTGGAT ACTGATGAAG CTTTAAATAT TAAAGATAGA
AGTAAGCAAC TTAACTTCTT TTCACTTAAT GGTAAATGGG AATTTAATTA CTTTGATAGC
TTACAAAAAG TTAAAGAGTT TGATGATATA AATCAAATTA GTTTTTCAGA TAATATTGAT
GTTCCATCTC TTTGGCAATT AAAGGGATAT GATTATAATC AATACACTAA TGTAAAATAT
CCAATTCCTT TTGATCCACC TTTTGTTCCT ATAAACAATC CATGTGGAAT TTATAAGAGA
GATTTTGAAA TTGAAATACT TCCTGAAAAT TATGATTATA ATATAAACTT TGAAGGCGTT
GATTCCTGCT TTTATTTTTG GATAAATGAT AACTTTGTTG GGTATAGCCA AATATCTCAT
AGCATATCTG AGTTTGATAT TACAGAATTT TTAGTAAAGG GTAAAAATAC AATTACAGTC
TTAGTTTTAA AATGGTGTGA TGGAACTTAT TTTGAAGATC AAGACAAGTT TAGAATGTCT
GGAATTTTTA GAGATGTATA TATATTAAGA AGAGCAAAGG AAAGAATAGT TGACTATAAA
ATAACTCAAA GTATAGATTT TTCAGCTAAA GAAGGAAAGT TAGACCTTGA AATTTTATCT
AATATAGGAA ATCCAAAAGG GAAATATTAT TTGTTAAATC CTAATAATCA CATGATAGCA
TCTGGAAATA TAGATAATAA TAAAATCCAA ATCAATATTA AAAATGTAGA ACTTTGGAGT
GCTGAAATAC CTAATTTATA TACCTTATTA ATTGAAACAG AGCATGAGGT TATAAAAGAA
AGAATAGGTA TGAGAGAAAT TAAAATAGAG AATTCTATTT TAAAAATTAA TAATAAAAAA
ATAAAGTTAA GAGGAGTTAA TCATCATGAT AGTAATCCAA CTAAGGGATA TGTTATGACA
TATGATGATA TGATTCTAGA TTTAAAAATA ATGAAAGAGT GTAATGTTAA CTCTATAAGA
ACAGCTCATT ATCCTAAATC TCCGATTTTT TATGAGTTAT GTGATGAATA TGGTTTTTAT
GTTATGAGTG AGGCAGATAT TGAAATTCAT GGGGTTGTTG AACTATATGG ATTAGGATAT
TTAGATAATT ATAATATGAT AGCTGATGAT AAAGTTTATG AAAAGGTTAT TATTGATAGA
GTTGATTCAT CTATAGTTCC TTTTAAGAAT AAATCTTGTA TTTTCATGTG GTCTCTTGGA
AATGAGTCAG GATTTGGATG TAACTTTGAA AGAGGATTAG AATATGCTAG GGCATTAGAT
CCTACACGTC CACTTCATTA TGAAGGTGCT TATTATGCTA GCAAAGAAAG AGAAAATGAC
TTTACAAATA TTGATGTTAT TAGTCGTATG TATATAAGTA TAGAGGAAAT CAAGGATTAC
TTTGAAAAAG GAATAGACAA GCCATTAATA TTATGTGAGT ATGCACATGC TATGGGAAAT
GGACCAGGTG GATTACAAGA CTATGATGAA ATGATACAAA AGTATGACCA GTTTGCAGGG
GCATATGTTT GGGAGTGGTG TGACCATGCA ATACTTATTA ATGAAAATAT TAATGATAAA
AAAGCATATG GATATGGCGG CGATTTTGAA GAGGAAAATC ATGATGGAAA CTTCTGTGTA
GATGGTTTAG TTTACCCAGA TAGAACTCCT CATACTGGAT TATTAGAATA TAAAAATATC
AATAGACCTA TAAGAGCCAT AGAATTTGAT GAAGTTAAGA AAAGAGTTAA GCTTAAAAAT
ATGTTTGATT TTAGGAATGC TGGTGAATTC TTAGATGTTA CCTATAAAGT ATTTTTAGAT
GGAGAAATAA TTTTCGGAGA TGATATAGAT TTAGAAAGCT TAAAAGCAAA AGAAGAAAAA
TGGTATGATT TAAGCATATC AGAATTACCA AAGGAAATTA TAACTATATT ATTTCAGTAT
AAAGTGAAAA ATCATAATCA TCTGTATGAA AAAGGTGAAG TTCTAGGCTT TGACAATTTT
ATCATAAAAA ATGGGGTAGA TAATATCTCA TCTGTTGATA AAATCATAAA AGGTAATATT
AATGAACAAA AATTTTATGT AGAAGAAACA GTTAATAAGA TTAAAGTTAT AAATAATGAG
TTTATTTATA ATTATAATAA AAATACAGGT TCCTTTGATT TTATTCAAGC ATTAGGAGAA
ACATTTATAG ATGATCCAAT GAAATTTATT ATTTGGAGAG CACCTACAGA TAATGATAGA
AAGATTAAAA ATCTGTGGAT TGAAGCTGGC TTTAATCAAA TTACTACAAG AGTGTATAAT
AGTAAAATTA AGGAGTTTTC AAATAGGGTG GAGATAACTA GCGATTTAAG CTTGATACCA
CCATATAGAG AAAGAGTCCT TGATCTTAAG GTAACATGGA GTATCTATTC AGAGGGGTTA
ATTAAATGCC ATGTTAAAGG GGATAAAAAT ATGAAAACAC CTTATTTACC AAGGTTTGGT
GTAGAGCTTA AGCTGAATAA ATCCTATGAA GAGGTAAGTT ACTTTGGATT TGGACCATAT
GAAAATTACG TAGATAAAAA TTCATCTTGT TATTTAGGAA GATTTAATTC TAAGGTTTCT
GAAATGCATG AAGATTATAT AAGACCTCAA GAAAATGGAA GTCATCATTA TTGTAGAGAA
GTAGCTATTA ATAATGAAAA AGGAAAGGTT TATGTTTTAT CAGAAAATGA CTTTGCCTTT
AATGTTTCAC ACTTTTCTTT AAATCAATTA ACTAATGCAA ATCATAATTT TGATTTGAAT
GAAGAAGAGG CAACTTATTT AATTGTAGAT TATAAACAAA GTGGTATAGG ATCAAATAGT
TGTGGCCCTG ATTTAGATGA AGAATATAGA CTAAATGAAA AAGAATTTTC TTATGATTTC
TACTTAAAAT TTGTAAAAGA TAATATAAAG GATAATTAA
 
Protein sequence
MILKNEYHED ISKLHVNMMP RRSYYVPFVD TDEALNIKDR SKQLNFFSLN GKWEFNYFDS 
LQKVKEFDDI NQISFSDNID VPSLWQLKGY DYNQYTNVKY PIPFDPPFVP INNPCGIYKR
DFEIEILPEN YDYNINFEGV DSCFYFWIND NFVGYSQISH SISEFDITEF LVKGKNTITV
LVLKWCDGTY FEDQDKFRMS GIFRDVYILR RAKERIVDYK ITQSIDFSAK EGKLDLEILS
NIGNPKGKYY LLNPNNHMIA SGNIDNNKIQ INIKNVELWS AEIPNLYTLL IETEHEVIKE
RIGMREIKIE NSILKINNKK IKLRGVNHHD SNPTKGYVMT YDDMILDLKI MKECNVNSIR
TAHYPKSPIF YELCDEYGFY VMSEADIEIH GVVELYGLGY LDNYNMIADD KVYEKVIIDR
VDSSIVPFKN KSCIFMWSLG NESGFGCNFE RGLEYARALD PTRPLHYEGA YYASKEREND
FTNIDVISRM YISIEEIKDY FEKGIDKPLI LCEYAHAMGN GPGGLQDYDE MIQKYDQFAG
AYVWEWCDHA ILINENINDK KAYGYGGDFE EENHDGNFCV DGLVYPDRTP HTGLLEYKNI
NRPIRAIEFD EVKKRVKLKN MFDFRNAGEF LDVTYKVFLD GEIIFGDDID LESLKAKEEK
WYDLSISELP KEIITILFQY KVKNHNHLYE KGEVLGFDNF IIKNGVDNIS SVDKIIKGNI
NEQKFYVEET VNKIKVINNE FIYNYNKNTG SFDFIQALGE TFIDDPMKFI IWRAPTDNDR
KIKNLWIEAG FNQITTRVYN SKIKEFSNRV EITSDLSLIP PYRERVLDLK VTWSIYSEGL
IKCHVKGDKN MKTPYLPRFG VELKLNKSYE EVSYFGFGPY ENYVDKNSSC YLGRFNSKVS
EMHEDYIRPQ ENGSHHYCRE VAINNEKGKV YVLSENDFAF NVSHFSLNQL TNANHNFDLN
EEEATYLIVD YKQSGIGSNS CGPDLDEEYR LNEKEFSYDF YLKFVKDNIK DN