Gene CPF_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1048 
SymbolfucI 
ID4202763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1195542 
End bp1197335 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content35% 
IMG OID638081929 
ProductL-fucose isomerase 
Protein accessionYP_695494 
Protein GI110800239 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2407] L-fucose isomerase and related proteins 
TIGRFAM ID[TIGR01089] L-fucose isomerase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA GTAGATTAGT AGGAAAGTAT CCTGTAATAG GAATAAGACC AACTATTGAT 
GGAAGAAGAG GAATAATAGA TGTAAGAGGT TCTCTTGAAG AACAAACAAT GAATATGGCA
AAGTCAGCAG CAAAGCTTTT AGAGGAAAAT TTAAAATATT CAAATGGAGA AAAAGTAAAG
GTTATAATAG CTGACACTAC AATTGGAAGA GTTCCAGAAG CTGCAGCTTG TGCAGATAAA
TTTAGAAGAG AAGGTGTAGA TATAACACTT ACAGTTACTC CATGTTGGTG CTATGGTGCA
GAAACAATGG ATATGGATCC AATGACTATA AAAGGGGTAT GGGGATTTAA TGGAACTGAA
AGACCAGGAG CTGTTTATTT AGCATCAGTT TTAGCAACTC ATGCTCAAAA GGGACTTCCT
GCCTTTGGAA TATATGGACA TGATGTTCAA AATGCAGATG ATACTGAAAT ACCAGAAGAC
GTAAAAGAAA AGATATTAAG ATTTGGAAGA AGTGCAATTG CAGCAGCATC TATGAGGGGA
AAATCTTATC TTCAAATTGG TTCAATATGT ATGGGAATAG GTGGATCTAT TATAGATCCA
AACTTTATAG AAGAATATTT AGGTATGAGA GTAGAATCTG TTGATGAAGT AGAAATTATA
AGAAGAATGA CAGAAGAAAT ATATGATAAA GATGAATTTG AAAGAGCTTT AAAATGGACT
AAGGAAAAAT GTAAGGAAGG TTTTGATAAA AATCCTGAGA ATGTTCAAAA AACAAGAGAA
GAAAAGGATA AGGATTGGGA ATTTGTAGTT AAAATGATGT GCATTATAAA GGATTTAATG
AATGGAAATG AAAATTTACC AGATGGATTT GAAGAAGAAA AATTAGGACA TAATGCAATA
GCGGCAGGTT TCCAAGGACA AAGACAATGG ACTGATTTTT ATCCAAATTG TGATTTCCCA
GAAGCACTAC TTAATACTTC ATTTGACTGG AATGGAGCTA GGGAGCCTTA TATATTAGCT
ACTGAAAATG ATGTTTTAAA TGGATTAGGT ATGCTTTTTG GAAAGCTACT TACAAATAAA
GCACAAATAT TTGCAGATGT TAGAACTTAT TGGAGTCCTG ATGCCGTTAA GAAAGCTACA
GGATATGAAT TAGAAGGGGT TGCAAAGGAA TCAGATGGAT TTATACATTT AATAAATTCA
GGCGCAGCTT GTCTTGATGC ATGTGGACAA GCTAAAGATG AAAATGGAAA TGGAACAATG
AAGGCTTGGT ATGATGTTAC AGAAGAAGAC CAAGAAGCAA TCCTTGCTGC AACTACATGG
AATGCTGCCG ATAATGGATA CTTTAGAGGT GGTGGATATT CATCAAGATT CTTAACAGAA
GCTGAAATGC CAGTAACAAT GATACGTTTA AATCTTGTGA AAGGCCTTGG TCCAGTTGTT
CAATTAGTTG AAGGATATTC AGTAAAACTT CCAGATGAAG TATCAGATAA ATTATGGAAA
AGAACAGATT ATACTTGGCC TTGTACTTGG TTTGCACCAA GACTTACAGG AAAAGGAGCA
TTTAAATCTG CTTATGATGT AATGAATAAT TGGGGTGCTA ACCATGGAGC TATAAGTCAT
GGACATATAG GGGCAGATAT AATTACTTTA TGTTCTATCT TAAGAATACC TGTAAGTATG
CATAATGTTC CAGAGGAAAA AATATTTAGA CCAGCAGCTT GGAATGCCTT TGGAATGGAT
AAAGAGGGAC AAGATTATAG AGCTTGTAAG GCTTATGGAC CAATGTATAA ATAA
 
Protein sequence
MAKSRLVGKY PVIGIRPTID GRRGIIDVRG SLEEQTMNMA KSAAKLLEEN LKYSNGEKVK 
VIIADTTIGR VPEAAACADK FRREGVDITL TVTPCWCYGA ETMDMDPMTI KGVWGFNGTE
RPGAVYLASV LATHAQKGLP AFGIYGHDVQ NADDTEIPED VKEKILRFGR SAIAAASMRG
KSYLQIGSIC MGIGGSIIDP NFIEEYLGMR VESVDEVEII RRMTEEIYDK DEFERALKWT
KEKCKEGFDK NPENVQKTRE EKDKDWEFVV KMMCIIKDLM NGNENLPDGF EEEKLGHNAI
AAGFQGQRQW TDFYPNCDFP EALLNTSFDW NGAREPYILA TENDVLNGLG MLFGKLLTNK
AQIFADVRTY WSPDAVKKAT GYELEGVAKE SDGFIHLINS GAACLDACGQ AKDENGNGTM
KAWYDVTEED QEAILAATTW NAADNGYFRG GGYSSRFLTE AEMPVTMIRL NLVKGLGPVV
QLVEGYSVKL PDEVSDKLWK RTDYTWPCTW FAPRLTGKGA FKSAYDVMNN WGANHGAISH
GHIGADIITL CSILRIPVSM HNVPEEKIFR PAAWNAFGMD KEGQDYRACK AYGPMYK