Gene CPF_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2333 
Symbol 
ID4203265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2593484 
End bp2595724 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content30% 
IMG OID638083198 
Productalpha-glucosidase 
Protein accessionYP_696756 
Protein GI110799390 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAT TAAATTACAG AGAGAATTTA AATATGAAGT TTAAAGCTTA TAATGGGGGA 
TTGAGAGTTT TTAAAAATTA TGAGATTAAT CATAATAATA TAGATATTTA TTTTTCAAAT
ATGAAAATAA CTCTTACTAT ATTTGAAAAT GATATAGTTA AAGTTTTTAT TGGAGACAAA
TACGAGGAAA GCATTTCAAC TAATGGTGTA GTAGATAATT TAGGAAAAGG TGAATTTATA
GTAGAAGAGG ATTCAAACTT TGTAATTGTA AAGGGCACTA AAGTTTTAAC CTTTGTAGAT
AAAAATACTA CAGAGATAAG TTTTAGAGAC TTAGAAGGAA ATATAATAAA TGAAGATTTT
CAGCCAAGCT TTAAGGATGA AGAAGGAAAT GTGTACATAT CAAAGGTAAA TGATTGTTTA
GCATATTATG GACTTGGAGA AAAGGGTGGA GATTTAAATA AAAAAGGATG TTATACAGAA
AACTTTAATA CTGATGATCC AGAGACTGAT GATGATTCCA TAACTTATTA TAAGACAATT
CCTTTTTATG TTGCCTTAAA AGAAGAAGCT ACCTATGGAA TATTCTTTGA TAATAGTTTT
AGAAGTTATT TTGACATGGG AAAGGAAATG GGAGATAGGA TTTTCTTTGG AGCCATAGGA
GGACAAATTC AATATTACTT TATTCCAGGA GAAAATATTA AAGAGGTAGT TAAGAATTAT
ACTGCTTTAA CAGGGAGAAT GGAGATACCA CCTCTTTGGA GCTTAGGGTA TCAACAATGT
AGATTTAGTT ATTTTAGCCA AGAGGAAGTA AGAGAATTAG TAAAAACCTT TGAAGAAAAA
GATATTCCCT TAGATGTAGT TTATTTAGAT ATAGATTACA TGGATGGATT TAGAGTTATG
ACTTTTAAAA CTCCTAATTT TGATGATGCA GCTGGTCTTA TTAGTGATTT AAAAGAAAAG
GGAATAAGAA CTATTACTAT TATTGACCCT GGTGTTAATG TAGATGAAGA GTATGATGTA
TTTAAAAGAG GTAAAGAAGG AAATCATTTT ACTAAAAAGT TAGATGGAGA AATGTTTATT
GGGGCAGTTT GGCCAGGTGA TAGTGCTTTC CCTGACTTTT CAAATAAGGA TTGTAGGGAA
TGGTGGAAAA GTGAACTTAA AAAATTCATA AGTGAACATG GCATGGATGG AATTTGGAAT
GATATGAATG AACCTTGTGT CTTTAATAAT GATCATAAAA CAATGTTAGA AACCTGTCTT
CATAATAGTG ATAATGGAGT TATAGAACAT AAGGAGTTTC ATAATAGATA TGGCTTTGAA
ATGAGCAGAT GTTCTAAGGA AGCGCAAGAA GAATTACATC CTAATGAAAG AGGATTTTCA
ATGACTAGAG CTACCTATGC TGGTGGACAA AGATATTCCT CAGTTTGGAC TGGAGATAAT
ATGAGCCTTT GGAGCCAAAT GAGAATGTCA ATATCAATGA ATGCTAATTT AGGAATCAGT
GGATTTTCCT TTGTTGGAAA TGATGTTTCA GGTTTTGGAT TAGATTCAAG TGAAGAATTA
TTTATAAGAT GGATGGAAAT GGGGCCATTT ATTCCTATAT TCAGAAATCA CTCCAATATG
TACACTAGAA GACAAGAACC ATGGGCTTTT GGACCAAGAG CTGAAAAAAT AGCAAAGAAA
TCTATTGAGT TAAGATATGA GTTACTTCCA TATATTTATG ATTTATATTA TATATCACAT
AAAGAAGGAC TTCCTATATT TAGACCTATG ATAATGGAAT ACGAGAAAGA TATGAATCTT
TTAAATATGA GAGAACAATT TATGTTAGGT GAAAATATGC TTGTTGCACC AGTATTATAT
GAAGGGGAAA GAAGCAAAAC TGTATATTTA CCAAAGGGAA GTTGGTTTAA TTATTTTACA
ATGGAGAAAT TACAAGGAGG AAAGTGGTAT AAGCTTCCTT GTGAATTAGA TGAAATTTTA
GTTTTTGTTA AAGAAGGCGC AATAATACCA ACATATAATA AGAAATTTAG AAATGTTAAA
GAAAGACCAA AGAATATACT TCTTAAGGTT TTTGGAGAGA ATGCTAAGGG ATTCCACTAT
AATGATGATG GACATACTAT GGAATATTTA GAGGGAAAAT ATACTTATAT GGACATAAAA
GTTGTAGATG GAAAAGAAGA ACTTAAGCTT ATTAATAATG GATATAGTAT AGAAGATATA
GAAATTCAAA TAATAAAATA A
 
Protein sequence
MTQLNYRENL NMKFKAYNGG LRVFKNYEIN HNNIDIYFSN MKITLTIFEN DIVKVFIGDK 
YEESISTNGV VDNLGKGEFI VEEDSNFVIV KGTKVLTFVD KNTTEISFRD LEGNIINEDF
QPSFKDEEGN VYISKVNDCL AYYGLGEKGG DLNKKGCYTE NFNTDDPETD DDSITYYKTI
PFYVALKEEA TYGIFFDNSF RSYFDMGKEM GDRIFFGAIG GQIQYYFIPG ENIKEVVKNY
TALTGRMEIP PLWSLGYQQC RFSYFSQEEV RELVKTFEEK DIPLDVVYLD IDYMDGFRVM
TFKTPNFDDA AGLISDLKEK GIRTITIIDP GVNVDEEYDV FKRGKEGNHF TKKLDGEMFI
GAVWPGDSAF PDFSNKDCRE WWKSELKKFI SEHGMDGIWN DMNEPCVFNN DHKTMLETCL
HNSDNGVIEH KEFHNRYGFE MSRCSKEAQE ELHPNERGFS MTRATYAGGQ RYSSVWTGDN
MSLWSQMRMS ISMNANLGIS GFSFVGNDVS GFGLDSSEEL FIRWMEMGPF IPIFRNHSNM
YTRRQEPWAF GPRAEKIAKK SIELRYELLP YIYDLYYISH KEGLPIFRPM IMEYEKDMNL
LNMREQFMLG ENMLVAPVLY EGERSKTVYL PKGSWFNYFT MEKLQGGKWY KLPCELDEIL
VFVKEGAIIP TYNKKFRNVK ERPKNILLKV FGENAKGFHY NDDGHTMEYL EGKYTYMDIK
VVDGKEELKL INNGYSIEDI EIQIIK