Gene Caci_5221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5221 
Symbol 
ID8336575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6004394 
End bp6008467 
Gene Length4074 bp 
Protein Length1357 aa 
Translation table11 
GC content69% 
IMG OID644958319 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003115921 
Protein GI256394357 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0332695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAA CCACCCCCCA CCGCCGCAAG CTGATAGCCG CAGGATCAGC CCTCGCCCTG 
GTCTGCGCGC TCGCCGCGTC CGCGACCAGC GTCGCGGCCT CCGCCAAGGA CACCGCCGCC
CCGGCGGCCA GCAGCACGCC GATCTACCTG AACACCAGCT ATCCCTTCGA GGCGCGCGCC
GCGGACCTGG TCTCGCGCAT GACGCTGGCG GAGAAGGCGG CGCAGCTGAA CACCACCAGC
GCGCCGGCGA TCCCGCGCCT GGGCGTGCAG CAGTACACGT ACCAGGCCGA GGCCCAGCAC
GGCATCAACT ACCTGGGCGG CGACCAGAAC AGCGGCAGCG TCGCGGGCAA CCCGCCGGTC
GCGACCAGCT TCCCGACGAA CTTCGCCTCC TCGATGTCCT GGGATCCGGC GCTGGTCTAC
CAGGAGACGA CCGCGGTCTC CGACGAGGCG CGCGGCTTGG TGGACAAGTC GCTGTTCGGC
ACCGGACAGA ACAACCTCGG TCCCTCGGCG AGCGACTACG GCTCGCTGAC GTTCTGGGCG
CCGACGGTCA ACCTGGACCG GGACCCGCGC TGGGGTCGCA CCGACGAGGC GTTCGGTGAG
GACCCGTACC TGGTCGGGCA GATGGCCGGG GCGTTCGTCA ACGGCTTCCA GGGCAACTCG
ATGACCGGGC AGTCGCTGGA CGGCTACCTC AAGGCCGCCG CGACCGCCAA GCACTACGCG
CTGAACGACG TGGAGCAGAA CCGCACCGGC ATCTCCTCCA ACGTCAGCGA CACCGACCTG
CGCGACTACT ACACCAAGCA GTTCGCGGAC CTGATCGAGA ACTCGCATGT CGCGGGTCTG
ATGACCTCCT ACAACGCGAT CAACGGCACG CCCTCGGTCG CCGACACCTA CACCGCCAAC
CAGCTCGCGC AGCGCACGTA CGGCTTCAAC GGCTACGTGA CCTCCGACTG CGGCGCGGTC
GGCACGGCCT ACCGCAACTT CCCCGCCGGC CACGCCTGGG CTCCCCCGGG CTGGACCACC
GACGGCGGCG ACACGAACTC GATCTGGACC AACACCTCCA CCGGCGCGAA GATCTCCGGC
GCGGCCGGCG GCGAGGCGTA CTCGCTGCGC GCCGGCACGC AGGTGAACTG CGGCGGCGAC
GAGTTCTCGC TGCAGAACAT CCAGGCGGCG ATCAGCGCCG GGATCCTGTC GGAGGGCGTC
ATCGACTCCG ACCTGACCAA GCTGTTCACG ATCCGCATGG AGACCGGCGA GTTCGACCCG
GCTTCGAAGG TTCCTTACAC CAGCATCACC AAGGCGCAGA TCCAGAGCCC GGCGCACCAG
GCGCTGGCCA CCAGCGTCGC GGACAACTCG CTGGTCCTGC TGAAGAACGC CAATGTCTCC
GGTACCAGCG CGCCGCTGCT GCCGGCCAGC GCGAGCAAGC TGGCCAACGT CGTCATCCTC
GGCGACATGG CCAACCAGGT GACCCTCGGC GACTACTCCG GCGCGCCGTC GCTGCAGGTG
AACGCGGTGC AGGGGTTGAC CACCGCGATC AAGGCGGCGA ACCCGAGCGC GAACATCCTC
TTCGACGCCG CCGGGACCTC CAGCACCACG ACCTCGGCGG CGACGCTGAG CAGCGCGACG
CAGGCCGCGA TCAAGAAGGC CGACCTGGTC GTGATGTTCG TCGGCACCAA CCAGAACAAC
GCCCAGGAGG GCAACGACCG CACCACGCTG AACATGCCGG GCAACTACGA CTCGCTGATC
ACCCAGACCA CGGCGCTGGG CAACCCGAAG ACCGCGCTGG TCGTGCAGTC CGACGGCCCG
GTGAAGATCA GCGACGTGCA GGGCAGCGTG CCGGCGGTCG TGTTCAGCGG CTACAACGGC
GAGAGCCAGG GCACGGCGCT GGCCGACGTG CTGCTCGGCA AGCAGAACCC GAGCGGGCAC
CTGAACTTCA CCTGGTACGC CGACGACTCG CAGCTTCCGG CGATGTCGAA CTACGGTCTG
ACCCCGGGCG ACACCAGCGG GCTCGGCCGG ACCTACCAGT ACTTCACCGG CACGCCGACC
TACCCGTTCG GCTACGGCCT GAGCTACTCG GCCTTCACCT ACTCCGCCGC GACCGTCGAC
AACGCCAGCC CGAACGCTGA CGGCACGGTC AACGTCAGCT TCAAGGTCAC CAACTCCGGC
AGCACCGCGG GGGCGACCGT GGCGCAGCTG TACGCCGCGA CGCAGTTCAC GGAGTCCGGG
GTGCAGCTGC CGACCAAGCG GCTGGTGGGC TTCCAGAAGA CCGGCGTCCT GAACCCCGGC
GCCGCGCAGC AGATCACGAT TCCGGTGAAG ATCAGCGACC TGTCGTTCTG GAACGCCACG
ACGATGAAGT CCGTGGTCTA CGACGGCACG TACGCCCTGC AGGTCGGCGC CAGCGCCTCG
GACATCCGGA CCTCGGTGAA CGTCGCGGTC TCCGGCGCCA TCACGCCGAA GGTGCAGACC
GTGACCGTGC AGCCGGAGAG CGTGGTCTAC AACGCCGGCT CGACCATCGA CCTGACCGGC
AAGAACCAGT GGATCAAGGA CGACACCACC GGCGTCGGCT CGGTCGCGCA GGGCCGGAAC
ATGAGCGTGA CGGCGGACAA CGTCATCGAG GCCGTCAACA ACGACCAGTC GTTCGTGAAC
CTGGCGGGCG CCTCGGTCAG CTACAGCAGC AGCGACCCGA CGGTCGCGAC CGTCACCAGC
ACCGGGCAGG TGCACGCGGT CGGCGACGGC ACGGCGCTGA TCAGCGTCAC GGTCAACGGC
GTCACCGGCA CCGCGCCGAT CGTGGTCCGG CACACGCTGA GCCTGGCCGC GCCGACGCTG
ATCACCGCGG GCGGCAGCGG CACGGCGACC ACGACGTTCG TCAACGGCGG CACCGCTGCC
GAGAGCAACG TCAATGTCGC ACTGACTCTG CCTTCGGGCT GGAGCGCGCA GGCCACCACG
CCTTCGTCGT TCGCCAGCGT CGCCGGCGGG CAGTCGGTGC AGACCACGTG GAAGGTGAAC
GCGCCGTCGG GCACGGCGGC GGGACCGTAC GCGGTGTCGG CGCAGGCGAC GTTGACCGGC
AGCGGTCCGT ACAGCGACTC CGGCACGATG AACATCGCCT ACCCGTCGCT GAGCGCCGCG
TTCAACAACG TCGGCACCAC CGACGACGCC AGCACGGCGG CCGGCAACCT CGACGGCGGC
GGGACGAGCT ATTCGGCCCA GGCGCTGTCC GCGGCTGCCG GGATCGCGCC TGGAGCCACG
CTCACCCACG ACGGCGCGAC GATCGTCTGG CCGAGTGCGG CGTCCGGGAC CAAGAACGAC
ATCGTGGCCT CCGGGCAGAC GGTTCCGGTG TCCGGCTCGG GGACGACGCT GTCTATCGTG
GGCACGTCGA CGTACGGGTC CTCCTCCGGC AGCGGGACCA TCATCTACAC CGACGGCAGC
ACCCAGAATT ACAGCCTGGG CTTCGCCGAC TGGTGGTCGA CGTCGGCGGC TCAGGGGACT
GACTTCCTGG CGAAGCCGAC CTACATCAAC GGCGGCAACG GCAAGATCAC GCAGGCGGTG
AACCTCTCGT ACGCGGCGAT CCCGTTGCAG GCGGGCAAGA CGGTGCAGGC CGTCGTGCTG
CCGAACGTCA GCGCGAGCGC CGTGAGCGGT TCGGTCTCGA TGCACATCTT CGCCGTGTCG
GTGTCCGGCA CCTCGGCGGG TTCGGTCGTC AGCCTGCGCT CGCACGCGAA CAACGACATC
GTGACCGCGG ACAACGGCGG CACCTCGCCG CTGATCGCGA ACCGGACCTC GGTCGGGCAG
TGGGAGTCGT TCGACCTGAT CACGAACTCC GACGGCAGCG TGAGCCTCCG GTCGCACGCG
AACAACGACA TCGTGACCGC CGACAACGCC GGTGCGGCGC CGCTGATCGC CAACCGGACC
TCGATCGGGC CGTGGGAGGA GTTCGACCTG ATCCACAACT CCGACGGCAG CGTCAGCTTC
CGGTCGCACG CGAACAACGA CATCGTGACC GCCGACAACG CCGGCGCGGC GCCGCTGATC
GCTAACCGGA CCGCGGTCGG CCCGTGGGAG GAGTTCGATC TGATCCACGA CTGA
 
Protein sequence
MRRTTPHRRK LIAAGSALAL VCALAASATS VAASAKDTAA PAASSTPIYL NTSYPFEARA 
ADLVSRMTLA EKAAQLNTTS APAIPRLGVQ QYTYQAEAQH GINYLGGDQN SGSVAGNPPV
ATSFPTNFAS SMSWDPALVY QETTAVSDEA RGLVDKSLFG TGQNNLGPSA SDYGSLTFWA
PTVNLDRDPR WGRTDEAFGE DPYLVGQMAG AFVNGFQGNS MTGQSLDGYL KAAATAKHYA
LNDVEQNRTG ISSNVSDTDL RDYYTKQFAD LIENSHVAGL MTSYNAINGT PSVADTYTAN
QLAQRTYGFN GYVTSDCGAV GTAYRNFPAG HAWAPPGWTT DGGDTNSIWT NTSTGAKISG
AAGGEAYSLR AGTQVNCGGD EFSLQNIQAA ISAGILSEGV IDSDLTKLFT IRMETGEFDP
ASKVPYTSIT KAQIQSPAHQ ALATSVADNS LVLLKNANVS GTSAPLLPAS ASKLANVVIL
GDMANQVTLG DYSGAPSLQV NAVQGLTTAI KAANPSANIL FDAAGTSSTT TSAATLSSAT
QAAIKKADLV VMFVGTNQNN AQEGNDRTTL NMPGNYDSLI TQTTALGNPK TALVVQSDGP
VKISDVQGSV PAVVFSGYNG ESQGTALADV LLGKQNPSGH LNFTWYADDS QLPAMSNYGL
TPGDTSGLGR TYQYFTGTPT YPFGYGLSYS AFTYSAATVD NASPNADGTV NVSFKVTNSG
STAGATVAQL YAATQFTESG VQLPTKRLVG FQKTGVLNPG AAQQITIPVK ISDLSFWNAT
TMKSVVYDGT YALQVGASAS DIRTSVNVAV SGAITPKVQT VTVQPESVVY NAGSTIDLTG
KNQWIKDDTT GVGSVAQGRN MSVTADNVIE AVNNDQSFVN LAGASVSYSS SDPTVATVTS
TGQVHAVGDG TALISVTVNG VTGTAPIVVR HTLSLAAPTL ITAGGSGTAT TTFVNGGTAA
ESNVNVALTL PSGWSAQATT PSSFASVAGG QSVQTTWKVN APSGTAAGPY AVSAQATLTG
SGPYSDSGTM NIAYPSLSAA FNNVGTTDDA STAAGNLDGG GTSYSAQALS AAAGIAPGAT
LTHDGATIVW PSAASGTKND IVASGQTVPV SGSGTTLSIV GTSTYGSSSG SGTIIYTDGS
TQNYSLGFAD WWSTSAAQGT DFLAKPTYIN GGNGKITQAV NLSYAAIPLQ AGKTVQAVVL
PNVSASAVSG SVSMHIFAVS VSGTSAGSVV SLRSHANNDI VTADNGGTSP LIANRTSVGQ
WESFDLITNS DGSVSLRSHA NNDIVTADNA GAAPLIANRT SIGPWEEFDL IHNSDGSVSF
RSHANNDIVT ADNAGAAPLI ANRTAVGPWE EFDLIHD