Gene Cfla_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3002 
Symbol 
ID9146914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3328669 
End bp3331098 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content77% 
IMG OID 
ProductAlpha-L-fucosidase 
Protein accessionYP_003638084 
Protein GI296130834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGACG ACGGGGCGGT GACGACGGCG TCCGGGCTGG TGCTGCGGCT CGACGAGCCC 
GCGCGCTGGT GGACCGACGC CTTCCCCGTC GGCAACGGGC GTCTGGGTGC CATGGTCCAC
GGCGGGACGG GTGCCGAGCG GCTGCAGGTC AACGACGACA CCTGCTGGTC GGGCGCCCCG
CACGACGGGA CCGTCGAGCC CGTCGGGCCG CTGGGGCCGG ACGGTGCGCC AGGTGTGGTG
CGGCGTGCAC GCCACCTCCT CGCGGAGGGC GACCCGCTCG CGGCGCAGGA CGAGCTCGCG
AAGCTGCAGA GCGGCTGGGT GCAGGCGTAC CAGCCGCTCG TCGACGTGCT CGTCGAGCAG
CCGGGAGCCG CGGGTCGCGA CGACTACCGC CGCGTGCTCG ACCTCGCGCG CGGCGTGGTG
ACGACGACGT GGCGCTCGGC CGCCGGCGAG CCGTGGCGGC AGGAGGTGCT CGTCAGCCAC
CCGGACGGCG CGCTGCTGCT GGAGCGGGCC GGGGCGCCGG GGGAGACGCG CGTGCGGCTG
GCGTCCCCGC ACCCGTGGGC GTCGACGCCG GCGGCCGCGG GCGACGGGAT CCTCGTCGCC
ACCCTCGACA TGCCCTCGCA CGTGCTGCCC GACTGGGTCG ACGGCCCTGA CCCCGTGCAG
TACGGCGGGC GGTCCGTGCA CGCCGCGGTG GCGCTCGCGG TGCTGGCCGA CGACGCCCCG
GTCGCGGTCG TCGACGGCGA GGTGCGGGTC ACCGGGGCAC GCCGCGTGCG CGTCGTCCTG
ACGTCCGCGA CCGACCACGA CGTCGCGACC GGCACGCTGC ACGGCGACCG GGAGCGCGTG
GCTGCCGACG CGCTCGCGGG CCTGCGGGGC GCGCTCGCGG ACGTCGACGG CATCCCTGCC
CGGCACGTCG CGGACCACGC CGCGCTCCTG GGGCGCGTGT CGCTCGACCT GGTCGCCGCG
CCGCCCGACC TGCCGCTCGA CGCGCGGCTC GCCCGCCACG CGGCCGGCGA GCCGGACGCG
CACCTGGCGG TCCTGGCGTT CCAGCTCGGC CGGTACCTCA CGGTCGCGGG CTCGCGGCCC
GGCACGCTGC CGCTCAACCT CCAGGGCATC TGGAACGAGC GGGTCCGCCC GCCGTGGAGC
TCGAACTACA CGATCAACAT CAACACCGAG ATGAACTACT GGCCCGCCCT CGTCGGCGAC
CTCGCGGAGT GCCACGAGCC GCTGCTGTCC TGGCTCGACC GGCTCGCCGC CGCGGGACGG
CAGACCGCCC GGACGCTGTA CGGCGCACGC GGCTGGGTGG CGCACCACAA CTCCGACCCG
TGGTGCTTCA CGGGCCCGAC GGGCCGCGGC CACGACTCCG CGTCGTGGTC GGCCTGGCCG
CTGGGCGGTG CGTGGCTGGC CCGGCACGTC GTCGACCACC ACGACTGGAC GGGCGACGAC
GACGCGCTGC GCCGGCACTG GCCGGTCGTG CGCGACGCCG CCCGCGCGGT GCTCGACCTG
CTCGTCGAGC TGCCCGACGG CACGCTCGGC ACGTCGCCGG GGACGAGCCC CGAGAACCAC
TACCTGCTGC CCGACGGCCG CCCCGCGGCG GTGGCGGTGT CGACCACGGC GGACCTCGCG
ATCGTGCGCG ACCTGCTCGA GCAGGTACGG CGTCTCGCGC CCGTCGTGCG GGACCGCGAC
GAGGACCTGC GCGCCGCCGT CGACGGCGCG CTCGAGCGGC TGCCCACCGA GCGCGTCGCC
CCCGACGGCC GGCTCGCCGA GTGGCACGAG GACGTGCCCG ACGCCGAGCC CGAGCACCGC
CACCAGTCGC ACCTGTACCG GGTGTTCCCC GGCACGTCGA TCGACCCGGA CACCACGCCC
GAGCTCGCGG CCGCGGCCCG TCGCACGCTC GACGCGCGCG GCCCGGAGTC GACCGGCTGG
TCGCTCGCGT GGCGGCTCGC GCTGCGCGCG CGGCTGCGCG ACCCCGAGGG CGTCGCGGCG
CTGGTGAGCG CGTTCCTGCA CCCCGTCCCC GGTGAGGAGC CCGCGTCCTG GCCGGCGCCC
GGCGGGGTCT ACCGCTCGCT GCTGTGCGCG CACCCGCCGT TCCAGGTCGA CGGCAACCTC
GGCTTCACCG CGGGCGTCGT CGAGGCGCTC GTGCAGGCGC ACCACCGCGG CCCCGACGGC
GTGCGTGAGG TGCACCTGCT GCCCGCGCTG CCGGCGTCCT GGCCGGAGGG ACGCGTGCAG
GGGCTGCGGC TGCGCGGCGG CGTGGACCTC GTCGACCTGC GGTGGGCCGA GGGTCGCGTC
GTGCTGGCCG AGCTCGCGGC CAAGCGGGAC GTTGTGGTCG ACGTGCGCGA GCGCGGCGGA
ACGGAGCGGG CGCAGGTGAC GCTGCGGCCG GGCAGGCCGG TGGTCATCGC AGGGTCGGGC
CGGGGATCGC TGTCGTGTGG CGCGTCGTGA
 
Protein sequence
MIDDGAVTTA SGLVLRLDEP ARWWTDAFPV GNGRLGAMVH GGTGAERLQV NDDTCWSGAP 
HDGTVEPVGP LGPDGAPGVV RRARHLLAEG DPLAAQDELA KLQSGWVQAY QPLVDVLVEQ
PGAAGRDDYR RVLDLARGVV TTTWRSAAGE PWRQEVLVSH PDGALLLERA GAPGETRVRL
ASPHPWASTP AAAGDGILVA TLDMPSHVLP DWVDGPDPVQ YGGRSVHAAV ALAVLADDAP
VAVVDGEVRV TGARRVRVVL TSATDHDVAT GTLHGDRERV AADALAGLRG ALADVDGIPA
RHVADHAALL GRVSLDLVAA PPDLPLDARL ARHAAGEPDA HLAVLAFQLG RYLTVAGSRP
GTLPLNLQGI WNERVRPPWS SNYTININTE MNYWPALVGD LAECHEPLLS WLDRLAAAGR
QTARTLYGAR GWVAHHNSDP WCFTGPTGRG HDSASWSAWP LGGAWLARHV VDHHDWTGDD
DALRRHWPVV RDAARAVLDL LVELPDGTLG TSPGTSPENH YLLPDGRPAA VAVSTTADLA
IVRDLLEQVR RLAPVVRDRD EDLRAAVDGA LERLPTERVA PDGRLAEWHE DVPDAEPEHR
HQSHLYRVFP GTSIDPDTTP ELAAAARRTL DARGPESTGW SLAWRLALRA RLRDPEGVAA
LVSAFLHPVP GEEPASWPAP GGVYRSLLCA HPPFQVDGNL GFTAGVVEAL VQAHHRGPDG
VREVHLLPAL PASWPEGRVQ GLRLRGGVDL VDLRWAEGRV VLAELAAKRD VVVDVRERGG
TERAQVTLRP GRPVVIAGSG RGSLSCGAS