Gene CPF_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0491 
Symbol 
ID4201373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp585866 
End bp588070 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content32% 
IMG OID638081373 
Productalpha-galactosidase 
Protein accessionYP_694945 
Protein GI110798960 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTA TGAGTATAAA TTATAATGAG AATTTTAAAA CTTTTCATCT AAGGACAAAG 
AATACAAGTT ATGTTTTGAA GGTTATGGAG ACAGGACATT TAAGTCATTT GTATTGGGGA
AGAAAGTTAA AAGCTGATAA CTTAGAGTAT TTTGTTAGAA GAAGAGGTTT TGGAAGCTTT
GCAGCTGATA CTGATAATAT AAGTGGATTT CAGTTAGAGT TAATACCTCA AGAGTGTCCA
ACCTTTGGAG CTACTGATTT AAGAAGTCCA AGCTTAGAGT TTCAATATGA AGATGGTACA
TCAGCTACTG ATTTAAGATA TAAGTCACAT AGAATTTACG AAGGAAAGCA AAGACTTTCA
GGTTTACCAG CTGTGTATGT TGAAAGTGAG GAAGAGGCTA CTTCCCTAGA AATAACTTTA
GTTGATTCTT TAAAAAACTT AGAGGTTATC TTAACATATA ATGTTTTTGA AAACTTTGAT
TCTATTACAA GAAGTTTAAA GATAGTGAAT AATAGTGATG AAAAGATAAA TATAGAGAGA
GTTTTAAGTG CCAATGTAGA CTTTACAACT GATGAATTTG ATTTTATTCA GCTTTCAGGA
TCTTGGGGAA GAGAGAGACA TATTCTTAGA AATCCTTTAA GAAGTGGAAG CCAAGCTATT
GAAAGTAGAA GAGGGGCAAG TAGCCATGCT CAAAATCCTT TCATGGCTCT ATGTAGTAAG
GATGCCAATG AAGAATATGG AGATGTTTAT GGCTTTAACT TAGTTTATAG TGGAAACTTC
TTAGCAAATG TGGAAGTTGA TATGTATAGA AATGCAAGAG CTCAAATAGG AATAAATCCT
TTCGATTTTA AATGGTTACT TGAGTCAAAA GAAGAGTTTC AAGCACCAGA GGTAGTTTTA
GTTTATTCCT CAAAGGGACT AAATGGCATG TCTCAAATTT ATCATAATCT TTATAGAAAG
AGATTGTGTA GAGGAAATTA TAGAGATAAG GTAAGACCTG TGCTTATAAA TAACTGGGAA
GCCACATATT TTGACTTTAA TGAGGTTAAG ATAAAGGAAA TAGCTAAGGA AGCTTCAAAG
CTAGGAATGG AACTTTTTGT TCTTGATGAT GGATGGTTTG GAAATAGAAA TGATGATAAA
AGTTCCTTAG GAGATTGGTT TGTTAATGAG GAAAAATTAA AGGGTGGACT TAGTAAACTT
GCTAAAGATA TAAATAATAT GGGGTTAGAG TTTGGATTAT GGTTTGAGCC TGAGATGATT
TCACCTATTA GTAAACTTTA TGAAAAACAT CCAAATTGGT GTATTCATAT TCCAGGAAGA
ACTAGATCAC AGGCAAGAAG TCAGTTAATA TTAGACCTAT CAATGAAAGA AGTATGTGAT
TATATAATAG AATCTGTTAG CAAAATTCTT GAAAGTTCTA ATATATCTTA TGTTAAGTGG
GATATGAATA GAAATATGAC AGAGGTGGGT TCTTTAGGAT TAACTTCAGA GAGACAAAGA
GAAACAGCTC ATAGATATAT TTTGGGATTA TATAGGGTTA TGGAGGAAAT AACAAGTAGA
TTCCCTAATG TATTATTTGA AAGCTGCTCA GGTGGTGGTG GAAGATTTGA CCCAGGAATA
CTTTATTATA TGCCTCAAAC TTGGACAAGC GATGATACAG ATGCCATAGA AAGATTAAAA
ATACAGTTTG GAACCTCTAT GATTTATCCT CCAATTTCCA TGGGATGCCA TGTTTCAGCA
ATTCCTAATC ATCAAGCTAA TAGAACAACT CCACTTGAAA CTAGAGGGGT ATCTGCTATG
GCAGGTAATT TTGGATATGA GCTTGATATA ACTAAGTTAA GTGAGGAAGA AAAGGAAGAA
TTAAAGGAAC AAATAAGTTT ATATAAAGAA ATTAGAGAAA CTGTGCAATT TGGAGCTTTG
TATAGATTAA AGAGTCCATT TAATAGTAAT GAAGTTGCAT GGATGATGAT TTCAGAAGAT
AAGAATGAGG TTGTTGTAAG CTATGTTAGA CAGTGGGCTT TAGTAAATGA AAGCTTTAGC
AATTTAAAAC TTACAGCCTT AGATAAGGAT TCAGAGTATG AAATAATAGG AGAAGACACA
ATCCTTAGTG GAGATGAGCT TATGTATATA GGTTTAAATA TTCCAGAACT TTATGGAGAT
TATGTTTCAA AACTTTGGAG ATTAAAGAGA AAAACTTTAA AATAA
 
Protein sequence
MRIMSINYNE NFKTFHLRTK NTSYVLKVME TGHLSHLYWG RKLKADNLEY FVRRRGFGSF 
AADTDNISGF QLELIPQECP TFGATDLRSP SLEFQYEDGT SATDLRYKSH RIYEGKQRLS
GLPAVYVESE EEATSLEITL VDSLKNLEVI LTYNVFENFD SITRSLKIVN NSDEKINIER
VLSANVDFTT DEFDFIQLSG SWGRERHILR NPLRSGSQAI ESRRGASSHA QNPFMALCSK
DANEEYGDVY GFNLVYSGNF LANVEVDMYR NARAQIGINP FDFKWLLESK EEFQAPEVVL
VYSSKGLNGM SQIYHNLYRK RLCRGNYRDK VRPVLINNWE ATYFDFNEVK IKEIAKEASK
LGMELFVLDD GWFGNRNDDK SSLGDWFVNE EKLKGGLSKL AKDINNMGLE FGLWFEPEMI
SPISKLYEKH PNWCIHIPGR TRSQARSQLI LDLSMKEVCD YIIESVSKIL ESSNISYVKW
DMNRNMTEVG SLGLTSERQR ETAHRYILGL YRVMEEITSR FPNVLFESCS GGGGRFDPGI
LYYMPQTWTS DDTDAIERLK IQFGTSMIYP PISMGCHVSA IPNHQANRTT PLETRGVSAM
AGNFGYELDI TKLSEEEKEE LKEQISLYKE IRETVQFGAL YRLKSPFNSN EVAWMMISED
KNEVVVSYVR QWALVNESFS NLKLTALDKD SEYEIIGEDT ILSGDELMYI GLNIPELYGD
YVSKLWRLKR KTLK