Gene Caci_4683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4683 
Symbol 
ID8336037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5333975 
End bp5336290 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content70% 
IMG OID644957783 
Productalpha-xylosidase YicI 
Protein accessionYP_003115385 
Protein GI256393821 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00144447 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.975878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTTCA CCGATGGCTA CTGGCTGATG CGGCCCGGCG TCCAGGCGGT CTATCCCGCC 
CAGGTGCTCG ACGGGCAGAT CGGTCCGGAC TCGCTGGTCG TCCACGCCCC CTGCTACCGC
TTCCACGACC GCGACGACCT GTTGCGCGGC CCGGCGCTCA CCGTCGCGTT CAGCGCGCCG
ATACCCGACG TGATCCGGGT CGCCGTCACG CACTTCGCCG GGCAGGACCG CAAGCCTGAT
TTCGTGCTGA TGGACGGCGA ATCGGAGCTC CCGGTGATCG CCGAGGACGA CGACGCGCTG
ACCTTCACCT CCGGCGCTCT GACCGCCCGC ATCGCCAAGG GCCCGCGTTG GAGCCTGGAC
TTTCTGGCCG AGGGCAGGCG GCTGACCGGC AGCGGCGCCA AGGCGATGGC CGCGATCGAC
ACCGCCGACG GCGAGCACTT CGTCCGTGAG CAGTTGGACC TGGCGATCGA CGCCTTCGTC
TACGGCCTCG GCGAGCGCTT CGGCCCGCTG GTGAAGAACG GGCAGTCCGT CGACAGCTGG
AACGCCGACG GCGGCACGGC CAGCGAGCAG GCGTACAAGA ACGTCCCGTT CTTCCTGACC
AACGCCGGCT ACGGGGTGTT CGTCAACCAT CCCGGCCGGG TCTCCTTCGA GGTCGCCTCC
GAGGCGGTGG CCCGGGCCCA GTTCAGCGTC GAGGGTCAGG AGCTGGAGTA CTTCCTCATC
TACGGGCCCA CGCCCCGGGA GATCCTGAGC AAGTACACCG CGCTGACCGG ACGTCCGCCG
CGCGTTCCGG CGTGGTCGTA CGGGCTGTGG CTGTCCACGT CGTTCACCAC CGACTACGAC
GAGGCGACGG TCGGGGGGTT CATCGACGAG ATGGTGCGGC GCGATCTGCC GCTGTCGGTG
TTCCACTTCG ACTCGTTCTG GATGCGCGAG TTCAACTGGT GCGACTTCGA GTGGGACCCG
CGCACGTTCC CCGATCCGCG CGGCATGCTG GAGCGGCTGA AGGCCCGCGG CTTGCGGATA
TGCGTGTGGA TCAACCCTTA TATAGCGCAG CGCTCGCCGC TGTTCGCCGA GGCGAAGGCC
GCCGGGTACC TGCTCAAGCG CGCCAACGGC GACGTCTGGC AGTGGGACCT GTGGCAGCCG
GGGCTGGCGG TCGTCGACTT CACCAACCCC GAGGCCCGGC AGTGGTACGC GGGCAAGCTG
GACGCGCTGG TCGAGATGGG CGTGGACTGC TTCAAGTCCG ACTTCGGCGA GCGCATCCCG
ACCGACGTGG TCTACGCCGA CGGCTCCGAT CCCGAGCGCG TGCACAACCT CTACGCCTAC
TACTACAACC AGACCGTCTT CGAGCTGCTG CGCAAGCGGC GCGGCGAGGG CGAGGCGCTG
GTGTTCGCCC GCTCGGCGAC CGTCGGCTCG CAGCAGTTCC CGGTGCACTG GGGCGGGGAC
TGCGAGTCCA CGTTCGAGGC GATGGCTGAG AGCCTGCGCG GCGGGCTGTC GCTGGGTCTG
TCCGGGTTCG GCTACTGGAG CCACGACATC GGCGGCTTCG AGGGGACGCC GGATCCGGCG
GTGTTCAAGC GCTGGATCGC CTTCGGGCTG CTGTCCTCGC ACAGCCGGCT GCACGGCCAC
GAGTCCTACC GCGTCCCCTG GCTGTTCGAC GAGGAAGCGG TGGACGTGCT GCGGAGCTTC
ACCAAGCTCA AGGCGCGCCT GATGCCCTAC CTGCTCAAGA GCGCGGAGCA GGTCGTCGCC
GGCGGTGTGC CGGTGATGCG GGCGATGGTG CTGGAGTTCC CAGACGATCC GGCGTGCACG
CACCTGGAGC GGCAGTACAT GCTCGGCGAC GACCTGCTGG TCGCGCCGGT GTTCACCGCC
GACGGAAGCG TGCGGTACTA CGTGCCGGAG GGGACGTGGA CGCACCTTGT GACGGGGGAG
AAGGTAGTCG GGCCGCGCTG GGCCCGCGAA CAGCACGGGT TCGACAGCGT CCCGCTGTTG
GCCCGGCCGG GGTCGGTGAT CCCGATCGGC GCGGTCGAGG ACCGGCCGGA GTACGACTAC
GCCGCCGGCG TCACGCTGCG GGTGTCCGAA CTCGGCGACG GCGCGGAAGT TTCAACGGTC
GTCCCGGCGG CCGACGGCTC CGTGCTCGCG ACGTTCACCA CGACCAGGAC CGGCCGCGAA
ATCCGCGTCA CCTCCTCCGG TACCGTGAAC GGCTGGAGGG TGCAGCTCTC CGGGGTCGGC
GCCGTGCGCG CCGAGGGCGG TGCGGTGACG CCGGATCCGC TCGGCGCCGT CGTCCGCGCC
GAGACCGGCA CCGTGGTGCT GACGCTTGAA GACTGA
 
Protein sequence
MKFTDGYWLM RPGVQAVYPA QVLDGQIGPD SLVVHAPCYR FHDRDDLLRG PALTVAFSAP 
IPDVIRVAVT HFAGQDRKPD FVLMDGESEL PVIAEDDDAL TFTSGALTAR IAKGPRWSLD
FLAEGRRLTG SGAKAMAAID TADGEHFVRE QLDLAIDAFV YGLGERFGPL VKNGQSVDSW
NADGGTASEQ AYKNVPFFLT NAGYGVFVNH PGRVSFEVAS EAVARAQFSV EGQELEYFLI
YGPTPREILS KYTALTGRPP RVPAWSYGLW LSTSFTTDYD EATVGGFIDE MVRRDLPLSV
FHFDSFWMRE FNWCDFEWDP RTFPDPRGML ERLKARGLRI CVWINPYIAQ RSPLFAEAKA
AGYLLKRANG DVWQWDLWQP GLAVVDFTNP EARQWYAGKL DALVEMGVDC FKSDFGERIP
TDVVYADGSD PERVHNLYAY YYNQTVFELL RKRRGEGEAL VFARSATVGS QQFPVHWGGD
CESTFEAMAE SLRGGLSLGL SGFGYWSHDI GGFEGTPDPA VFKRWIAFGL LSSHSRLHGH
ESYRVPWLFD EEAVDVLRSF TKLKARLMPY LLKSAEQVVA GGVPVMRAMV LEFPDDPACT
HLERQYMLGD DLLVAPVFTA DGSVRYYVPE GTWTHLVTGE KVVGPRWARE QHGFDSVPLL
ARPGSVIPIG AVEDRPEYDY AAGVTLRVSE LGDGAEVSTV VPAADGSVLA TFTTTRTGRE
IRVTSSGTVN GWRVQLSGVG AVRAEGGAVT PDPLGAVVRA ETGTVVLTLE D