Gene Ccel_2455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2455 
Symbol 
ID7311124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2966986 
End bp2969298 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content45% 
IMG OID643609385 
Productalpha-xylosidase YicI 
Protein accessionYP_002506764 
Protein GI220929855 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.623882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTC TAAATGGATA CTGGATGAGT AAGGAAGGTT ACTCTCTGTA TTACCCGTCC 
GAGGCATACC GTATTGAAAA AAGCGGAGAC GTATTACGTA TTTTTGCACC ATGCAATCAG
ATAAATCACA GGGGTGATAC TTTAGGTGGC CCTGCACTGA CTATTGAGCT GAGTTCCCCG
GCGGAGGATG TAATTCGCAT AAGTGTATAC CACTATAAAG GGGTCAAAAA AGCGGGACCG
TATTTTGAAC TTAATACTCC CGGAATTTCT CCGTCAATCA GTGAAGATGA AGATGCTGTG
TGGTTTGGCA GCGGTCAGAT AAAGGCTCGT ATTGATAAAA AGACCTATAA TATAGACTAT
TATCGCGGCG ATGAACGTCT GACCGGGAGC GGTTGGAGGC ATCTGGCCTA CATAAAGCAT
GAAAGCGGTC AAACCTATAT GAGAGAACAG CTTGATCTGG ACGTGGGGGA GTGTGTATAC
GGTTTGGGAG AGCGATTTAC ACCCTTTGTA AAGAATGGTC AGACAGTTGA TATTTGGAAT
CAGGATGGAG GTACTTGTAC AGAACAATCC TACAAAAATA TTCCATTTTA CATAACAAAC
AGAGGGTATG GAGTTTTTGT AAACGATCCG GGTCTCGTGT CTTTTGAGAT ATGCAGTGAA
GTGGTGTCAC GTTCACAATT CTCAGTTGAA GGCGAATCGT TGGACTATTT TATAATAGGA
GGTTCAGACT GTAAGGAAGT GATTTCAAAC TATACTGCTC TTACAGGACG GCCTGCTATA
CCTCCTGCAT GGTCTTTCGG TCTATGGCTT TCAACTTCAT TTACCACTAA TTACGATGAA
AAAACCGTCA CAAGCTTTAT CAATGGAATG AGTAAGCGGC GTATACCACT ATCAGTATTC
CACTTTGACT GCTTTTGGAT GAAAGAGTTC AACTGGTGTG ATTTTATTTG GGACAAAGAT
GTTTTTCCTG ACCCCAAAAA GATGCTATCC AAGCTAAAGG AAAAGGGCCT GCATATCTGT
GTTTGGATTA ACTCATATGT CTCACAGGAG TCTGTTCTGT TTGATGAAGG GATGCAAAAA
GGATACTTCA TACACAAAAA GAACGGCAGT GTTTGGCAGT GGGATATGTG GCAGCCGGGA
ATGGCCATCG TTGATTTTAC AAATCCCGAT GCGTGTAATT GGTTCTCTCA AAAGCTCCTT
AACCTTGTAG ATATGGGTGT AGATTGCTTC AAAACTGATT TCGGTGAGCG TATCCCTACG
GAGGATGTTG TTTATTACGA TGGTTCAGAC CCCGTGAAGA TGCATAATTT TTACACTTAT
CTATACAACA GGACTGTATT TGATACTTTG AAACAAGTCG GTAAAGAGGC AGTAGTTTTT
GCTCGTTCTG CGACGGCCGG GAGCCAAAAG TTCCCGGTTC ACTGGGGCGG AGATTGTACT
GCTGACTTTT CATCAATGGC TGAAAGCCTG CGGGGAGGAC TGTCACTGGG ATTGTGTGGT
TTCGGTTTCT GGAGTCATGA TATTGGAGGT TTTGAGCAAA CCGCTACAGC TGATGTGTAT
AAACGCTGGC TGGCTTTCGG TATGCTTTCA TCCCACTCAC GTCTACATGG CAGTACCGGC
TATCGTGTAC CTTGGCTATA TGATGAAGAA GCAGTTGATG TACTGCGTTT CTTTGCAAAT
CTTAAATGCC GCTTGATGCC ATATATATAT AAAACTGCGA TTCAGGCCTC GCAGCAGGGT
CTGCCCTCAG CGCGTGCAAT GTTTGTTGAG TTCCCGGAGG ATCCTGCCTG TACAACCCTT
GACCGTCAAT ACATGCTGGG CGATTCCCTG CTGGTTGCAC CTGTGTTTTC AAAATCAGGG
CTGGTAGAGT ATTATTTGCC GTTGGGTGAG TGGTATAACC TTCTGACGGG GGAAATTGTT
ACGGGTGGCA GTTATCGCAA AGAAAAGCAC AATTACATGA GTCTGCCGTT ATTTGTCCGT
CCCGGCAGCC TTCTGGCCAT AGGAGACAAC GAGGAAGAAA CAGTCTATGA ATACGCACAA
GGGGTTCGCC TATTATTGAC GCCCCTGGCT GACGGTTACG AGGCAAGCAC ATCTGTCTAC
GAGAAAGACG GACATGAGGC ACTAAATGTT ACGGTTAACC GTAAAGAAAG TGATATCACT
GTGATTGCCG AGGGAGACGG AAAACCGTGG AGTATAAAAC TGTGTGGTAT CACAGCAAAA
GGATGTACAG GCGGAACTAT TGAGAAGGAA CAGGATGGTA TTGTATTTAT GCCGGGGAAT
TCTACTGGCA GTTATACAGT AAGAATTCTA TAG
 
Protein sequence
MKFLNGYWMS KEGYSLYYPS EAYRIEKSGD VLRIFAPCNQ INHRGDTLGG PALTIELSSP 
AEDVIRISVY HYKGVKKAGP YFELNTPGIS PSISEDEDAV WFGSGQIKAR IDKKTYNIDY
YRGDERLTGS GWRHLAYIKH ESGQTYMREQ LDLDVGECVY GLGERFTPFV KNGQTVDIWN
QDGGTCTEQS YKNIPFYITN RGYGVFVNDP GLVSFEICSE VVSRSQFSVE GESLDYFIIG
GSDCKEVISN YTALTGRPAI PPAWSFGLWL STSFTTNYDE KTVTSFINGM SKRRIPLSVF
HFDCFWMKEF NWCDFIWDKD VFPDPKKMLS KLKEKGLHIC VWINSYVSQE SVLFDEGMQK
GYFIHKKNGS VWQWDMWQPG MAIVDFTNPD ACNWFSQKLL NLVDMGVDCF KTDFGERIPT
EDVVYYDGSD PVKMHNFYTY LYNRTVFDTL KQVGKEAVVF ARSATAGSQK FPVHWGGDCT
ADFSSMAESL RGGLSLGLCG FGFWSHDIGG FEQTATADVY KRWLAFGMLS SHSRLHGSTG
YRVPWLYDEE AVDVLRFFAN LKCRLMPYIY KTAIQASQQG LPSARAMFVE FPEDPACTTL
DRQYMLGDSL LVAPVFSKSG LVEYYLPLGE WYNLLTGEIV TGGSYRKEKH NYMSLPLFVR
PGSLLAIGDN EEETVYEYAQ GVRLLLTPLA DGYEASTSVY EKDGHEALNV TVNRKESDIT
VIAEGDGKPW SIKLCGITAK GCTGGTIEKE QDGIVFMPGN STGSYTVRIL