Gene Caci_4947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4947 
Symbol 
ID8336301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5648321 
End bp5650255 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content70% 
IMG OID644958046 
ProductAlpha-L-fucosidase 
Protein accessionYP_003115648 
Protein GI256394084 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00851313 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC CTGTGTCCCG ACGCGCCGTG CTGTCCGGCA CGGCGGCGGC AATAGCCGCC 
ACGAGTCTCA CCGCCGGCGG CGAGAGCCCT GCCGCCGCCG TACAAACCGC TGCCGCGCAA
GCCAATCCGC TGGTACCCCT CCGGATCCCG AAGCTGGACC AGGGCATGTG GCAGCAGCCC
GACGCGACCA TCCAGTGGCT GCGCGACACC CGGCTGGGCA TGTTCATCCA CTGGGGCGTC
TACGCCGGCC CGGCACACGG CGAGTGGTAC ATGCACTCGG CCGCGATCAC TCCGGCTGCC
TACCAGAACT TCGTCACGCA GTCCTCGCCG CAGCAGTTCA CCGCCGACGC CTACAAGCCC
TCCGACTGGG CCGCCCTGGC CAAGCAGCTC GGCGCCGGCT ACACCGTGCT GACCGCCCGA
CACCACGACG GCTTCGCGCT GTGGCCCAGC ACCTACCCCG CCGGCTGGAA CGCCGGACAG
GCACCGCTGA AGCGCGATTT CGTCAAGGAC TACGTCGCTG CGGTCCGCGC GGCAGGTCTG
CGGGTCGGCC TGTACTACTC CCCGATCAAC TGGCGCTACC CCGGCTACTA CGACGTCACC
GGTACCAACT GCGCACCCAA CCCGTGGGGC TACACCACCG ACCCGGCGCA CCACGAGAAC
GCGCGCCTGA TGAAGGAGGA GGTCTATCAG TGCGTCAAGG AACTGGTGAC CCAGTACGGC
GCGCTCGACG ACATCTGGTG GGACGGCGGC TGGCTGGCCG AGCAGGGCAG CGACGCCGAC
GCGTCGTTCT TCTGGGAGCC GGGCCGCTAC CGGGACTCCG GTAACGGCTG GCAGGTGGAC
GCCGCCTACG GCGCCACCGA GACCGGCACC GGACTGCCGC TGGGACTCGC CGGCCTGGTG
CGCCAGAGCC AGCCGACGGC GCTGGCCTCG CCCCGGTCGG GCTGGGTCGG CGACTACGAC
GTCGACGAGG GCGGCTCCGT GCCCGCCGGC CGCCTGCGGT CCGGCCACCT GGTGCAGAAG
ACGTTCAGCG TCGGCAGCAC CTGGGGCTAC AACTCCGACG GCGCCGTGAT GAGCCACGCC
TCGGCCATGG CCTTGCTGGT CAACAGTTTC ATCCGGGACA TGTGTGTGCT GATCAACGTG
GGACCGGACC GCCACGGTAC CGTCCCGGCG AATCAGGCCG CCCTCATGCA ACAGCTGGGC
ACGTTCATGA AAGCCAACGG CGAAGCGGTC TACAACACCC GCGGCGGACC ATGGGACCCG
GTGGACGGCC AGTACGGCTT CACCTTCAGC GGACAGACCG TCTACGTCCA TCTCCTGCCC
GGCTACTCCG GAAGCAGCTT CACCACGCCG GCACTGGGCG ATGCCCGCGC CGTCCGAGCC
TACGACGTGG TCTCACACGC CCCGCTCACC CTCGGTACCG GTTCCGGAAA CCAGGCCACG
ATCAGCGGTA TCGACCGCAC CCGCTACCCG GACGACACCG TCGTCGCGCT CGTCCTGGAC
CGGACGGTCG TCCCCGCCGA CATCGCCCTG GGCCGCACCG CCGGCGCGGA CAGCGTCGAG
ACCGCCCACG GCAACCTCGC CTCCCACGCC GTGGACGGCG ACACCTCCAC CCGCTGGTGT
GCGAGCGACG GTGCCGCCGG CCACTGGCTG ACCGTGGACC TCGGCGGCGT CCACACCGTC
ACCGGGGCGC GCGTCGCCTG GGAATTCGGT GGACACCGGT ACGGGTACCG GATCGACGGC
TCCACCGACG GCACGTCCTG GACGACGCTT TCCGACCAGA GCGCGACCGG CAGCACCTCC
CAAGTCCAGA CCGTCGCCTT CACCGCGGCC ACGCGCCACC TGCGGATCAC CGTCACCGCC
CTGGACGCCG GCTGCTGGGC TTCGATCCGT TCCTTCGAGG TCTACGACCG GGCCTTCTAC
GATCCTTCGC TGTAG
 
Protein sequence
MTNPVSRRAV LSGTAAAIAA TSLTAGGESP AAAVQTAAAQ ANPLVPLRIP KLDQGMWQQP 
DATIQWLRDT RLGMFIHWGV YAGPAHGEWY MHSAAITPAA YQNFVTQSSP QQFTADAYKP
SDWAALAKQL GAGYTVLTAR HHDGFALWPS TYPAGWNAGQ APLKRDFVKD YVAAVRAAGL
RVGLYYSPIN WRYPGYYDVT GTNCAPNPWG YTTDPAHHEN ARLMKEEVYQ CVKELVTQYG
ALDDIWWDGG WLAEQGSDAD ASFFWEPGRY RDSGNGWQVD AAYGATETGT GLPLGLAGLV
RQSQPTALAS PRSGWVGDYD VDEGGSVPAG RLRSGHLVQK TFSVGSTWGY NSDGAVMSHA
SAMALLVNSF IRDMCVLINV GPDRHGTVPA NQAALMQQLG TFMKANGEAV YNTRGGPWDP
VDGQYGFTFS GQTVYVHLLP GYSGSSFTTP ALGDARAVRA YDVVSHAPLT LGTGSGNQAT
ISGIDRTRYP DDTVVALVLD RTVVPADIAL GRTAGADSVE TAHGNLASHA VDGDTSTRWC
ASDGAAGHWL TVDLGGVHTV TGARVAWEFG GHRYGYRIDG STDGTSWTTL SDQSATGSTS
QVQTVAFTAA TRHLRITVTA LDAGCWASIR SFEVYDRAFY DPSL