Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5221 |
Symbol | |
ID | 8336575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6004394 |
End bp | 6008467 |
Gene Length | 4074 bp |
Protein Length | 1357 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958319 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003115921 |
Protein GI | 256394357 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0332695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAA CCACCCCCCA CCGCCGCAAG CTGATAGCCG CAGGATCAGC CCTCGCCCTG GTCTGCGCGC TCGCCGCGTC CGCGACCAGC GTCGCGGCCT CCGCCAAGGA CACCGCCGCC CCGGCGGCCA GCAGCACGCC GATCTACCTG AACACCAGCT ATCCCTTCGA GGCGCGCGCC GCGGACCTGG TCTCGCGCAT GACGCTGGCG GAGAAGGCGG CGCAGCTGAA CACCACCAGC GCGCCGGCGA TCCCGCGCCT GGGCGTGCAG CAGTACACGT ACCAGGCCGA GGCCCAGCAC GGCATCAACT ACCTGGGCGG CGACCAGAAC AGCGGCAGCG TCGCGGGCAA CCCGCCGGTC GCGACCAGCT TCCCGACGAA CTTCGCCTCC TCGATGTCCT GGGATCCGGC GCTGGTCTAC CAGGAGACGA CCGCGGTCTC CGACGAGGCG CGCGGCTTGG TGGACAAGTC GCTGTTCGGC ACCGGACAGA ACAACCTCGG TCCCTCGGCG AGCGACTACG GCTCGCTGAC GTTCTGGGCG CCGACGGTCA ACCTGGACCG GGACCCGCGC TGGGGTCGCA CCGACGAGGC GTTCGGTGAG GACCCGTACC TGGTCGGGCA GATGGCCGGG GCGTTCGTCA ACGGCTTCCA GGGCAACTCG ATGACCGGGC AGTCGCTGGA CGGCTACCTC AAGGCCGCCG CGACCGCCAA GCACTACGCG CTGAACGACG TGGAGCAGAA CCGCACCGGC ATCTCCTCCA ACGTCAGCGA CACCGACCTG CGCGACTACT ACACCAAGCA GTTCGCGGAC CTGATCGAGA ACTCGCATGT CGCGGGTCTG ATGACCTCCT ACAACGCGAT CAACGGCACG CCCTCGGTCG CCGACACCTA CACCGCCAAC CAGCTCGCGC AGCGCACGTA CGGCTTCAAC GGCTACGTGA CCTCCGACTG CGGCGCGGTC GGCACGGCCT ACCGCAACTT CCCCGCCGGC CACGCCTGGG CTCCCCCGGG CTGGACCACC GACGGCGGCG ACACGAACTC GATCTGGACC AACACCTCCA CCGGCGCGAA GATCTCCGGC GCGGCCGGCG GCGAGGCGTA CTCGCTGCGC GCCGGCACGC AGGTGAACTG CGGCGGCGAC GAGTTCTCGC TGCAGAACAT CCAGGCGGCG ATCAGCGCCG GGATCCTGTC GGAGGGCGTC ATCGACTCCG ACCTGACCAA GCTGTTCACG ATCCGCATGG AGACCGGCGA GTTCGACCCG GCTTCGAAGG TTCCTTACAC CAGCATCACC AAGGCGCAGA TCCAGAGCCC GGCGCACCAG GCGCTGGCCA CCAGCGTCGC GGACAACTCG CTGGTCCTGC TGAAGAACGC CAATGTCTCC GGTACCAGCG CGCCGCTGCT GCCGGCCAGC GCGAGCAAGC TGGCCAACGT CGTCATCCTC GGCGACATGG CCAACCAGGT GACCCTCGGC GACTACTCCG GCGCGCCGTC GCTGCAGGTG AACGCGGTGC AGGGGTTGAC CACCGCGATC AAGGCGGCGA ACCCGAGCGC GAACATCCTC TTCGACGCCG CCGGGACCTC CAGCACCACG ACCTCGGCGG CGACGCTGAG CAGCGCGACG CAGGCCGCGA TCAAGAAGGC CGACCTGGTC GTGATGTTCG TCGGCACCAA CCAGAACAAC GCCCAGGAGG GCAACGACCG CACCACGCTG AACATGCCGG GCAACTACGA CTCGCTGATC ACCCAGACCA CGGCGCTGGG CAACCCGAAG ACCGCGCTGG TCGTGCAGTC CGACGGCCCG GTGAAGATCA GCGACGTGCA GGGCAGCGTG CCGGCGGTCG TGTTCAGCGG CTACAACGGC GAGAGCCAGG GCACGGCGCT GGCCGACGTG CTGCTCGGCA AGCAGAACCC GAGCGGGCAC CTGAACTTCA CCTGGTACGC CGACGACTCG CAGCTTCCGG CGATGTCGAA CTACGGTCTG ACCCCGGGCG ACACCAGCGG GCTCGGCCGG ACCTACCAGT ACTTCACCGG CACGCCGACC TACCCGTTCG GCTACGGCCT GAGCTACTCG GCCTTCACCT ACTCCGCCGC GACCGTCGAC AACGCCAGCC CGAACGCTGA CGGCACGGTC AACGTCAGCT TCAAGGTCAC CAACTCCGGC AGCACCGCGG GGGCGACCGT GGCGCAGCTG TACGCCGCGA CGCAGTTCAC GGAGTCCGGG GTGCAGCTGC CGACCAAGCG GCTGGTGGGC TTCCAGAAGA CCGGCGTCCT GAACCCCGGC GCCGCGCAGC AGATCACGAT TCCGGTGAAG ATCAGCGACC TGTCGTTCTG GAACGCCACG ACGATGAAGT CCGTGGTCTA CGACGGCACG TACGCCCTGC AGGTCGGCGC CAGCGCCTCG GACATCCGGA CCTCGGTGAA CGTCGCGGTC TCCGGCGCCA TCACGCCGAA GGTGCAGACC GTGACCGTGC AGCCGGAGAG CGTGGTCTAC AACGCCGGCT CGACCATCGA CCTGACCGGC AAGAACCAGT GGATCAAGGA CGACACCACC GGCGTCGGCT CGGTCGCGCA GGGCCGGAAC ATGAGCGTGA CGGCGGACAA CGTCATCGAG GCCGTCAACA ACGACCAGTC GTTCGTGAAC CTGGCGGGCG CCTCGGTCAG CTACAGCAGC AGCGACCCGA CGGTCGCGAC CGTCACCAGC ACCGGGCAGG TGCACGCGGT CGGCGACGGC ACGGCGCTGA TCAGCGTCAC GGTCAACGGC GTCACCGGCA CCGCGCCGAT CGTGGTCCGG CACACGCTGA GCCTGGCCGC GCCGACGCTG ATCACCGCGG GCGGCAGCGG CACGGCGACC ACGACGTTCG TCAACGGCGG CACCGCTGCC GAGAGCAACG TCAATGTCGC ACTGACTCTG CCTTCGGGCT GGAGCGCGCA GGCCACCACG CCTTCGTCGT TCGCCAGCGT CGCCGGCGGG CAGTCGGTGC AGACCACGTG GAAGGTGAAC GCGCCGTCGG GCACGGCGGC GGGACCGTAC GCGGTGTCGG CGCAGGCGAC GTTGACCGGC AGCGGTCCGT ACAGCGACTC CGGCACGATG AACATCGCCT ACCCGTCGCT GAGCGCCGCG TTCAACAACG TCGGCACCAC CGACGACGCC AGCACGGCGG CCGGCAACCT CGACGGCGGC GGGACGAGCT ATTCGGCCCA GGCGCTGTCC GCGGCTGCCG GGATCGCGCC TGGAGCCACG CTCACCCACG ACGGCGCGAC GATCGTCTGG CCGAGTGCGG CGTCCGGGAC CAAGAACGAC ATCGTGGCCT CCGGGCAGAC GGTTCCGGTG TCCGGCTCGG GGACGACGCT GTCTATCGTG GGCACGTCGA CGTACGGGTC CTCCTCCGGC AGCGGGACCA TCATCTACAC CGACGGCAGC ACCCAGAATT ACAGCCTGGG CTTCGCCGAC TGGTGGTCGA CGTCGGCGGC TCAGGGGACT GACTTCCTGG CGAAGCCGAC CTACATCAAC GGCGGCAACG GCAAGATCAC GCAGGCGGTG AACCTCTCGT ACGCGGCGAT CCCGTTGCAG GCGGGCAAGA CGGTGCAGGC CGTCGTGCTG CCGAACGTCA GCGCGAGCGC CGTGAGCGGT TCGGTCTCGA TGCACATCTT CGCCGTGTCG GTGTCCGGCA CCTCGGCGGG TTCGGTCGTC AGCCTGCGCT CGCACGCGAA CAACGACATC GTGACCGCGG ACAACGGCGG CACCTCGCCG CTGATCGCGA ACCGGACCTC GGTCGGGCAG TGGGAGTCGT TCGACCTGAT CACGAACTCC GACGGCAGCG TGAGCCTCCG GTCGCACGCG AACAACGACA TCGTGACCGC CGACAACGCC GGTGCGGCGC CGCTGATCGC CAACCGGACC TCGATCGGGC CGTGGGAGGA GTTCGACCTG ATCCACAACT CCGACGGCAG CGTCAGCTTC CGGTCGCACG CGAACAACGA CATCGTGACC GCCGACAACG CCGGCGCGGC GCCGCTGATC GCTAACCGGA CCGCGGTCGG CCCGTGGGAG GAGTTCGATC TGATCCACGA CTGA
|
Protein sequence | MRRTTPHRRK LIAAGSALAL VCALAASATS VAASAKDTAA PAASSTPIYL NTSYPFEARA ADLVSRMTLA EKAAQLNTTS APAIPRLGVQ QYTYQAEAQH GINYLGGDQN SGSVAGNPPV ATSFPTNFAS SMSWDPALVY QETTAVSDEA RGLVDKSLFG TGQNNLGPSA SDYGSLTFWA PTVNLDRDPR WGRTDEAFGE DPYLVGQMAG AFVNGFQGNS MTGQSLDGYL KAAATAKHYA LNDVEQNRTG ISSNVSDTDL RDYYTKQFAD LIENSHVAGL MTSYNAINGT PSVADTYTAN QLAQRTYGFN GYVTSDCGAV GTAYRNFPAG HAWAPPGWTT DGGDTNSIWT NTSTGAKISG AAGGEAYSLR AGTQVNCGGD EFSLQNIQAA ISAGILSEGV IDSDLTKLFT IRMETGEFDP ASKVPYTSIT KAQIQSPAHQ ALATSVADNS LVLLKNANVS GTSAPLLPAS ASKLANVVIL GDMANQVTLG DYSGAPSLQV NAVQGLTTAI KAANPSANIL FDAAGTSSTT TSAATLSSAT QAAIKKADLV VMFVGTNQNN AQEGNDRTTL NMPGNYDSLI TQTTALGNPK TALVVQSDGP VKISDVQGSV PAVVFSGYNG ESQGTALADV LLGKQNPSGH LNFTWYADDS QLPAMSNYGL TPGDTSGLGR TYQYFTGTPT YPFGYGLSYS AFTYSAATVD NASPNADGTV NVSFKVTNSG STAGATVAQL YAATQFTESG VQLPTKRLVG FQKTGVLNPG AAQQITIPVK ISDLSFWNAT TMKSVVYDGT YALQVGASAS DIRTSVNVAV SGAITPKVQT VTVQPESVVY NAGSTIDLTG KNQWIKDDTT GVGSVAQGRN MSVTADNVIE AVNNDQSFVN LAGASVSYSS SDPTVATVTS TGQVHAVGDG TALISVTVNG VTGTAPIVVR HTLSLAAPTL ITAGGSGTAT TTFVNGGTAA ESNVNVALTL PSGWSAQATT PSSFASVAGG QSVQTTWKVN APSGTAAGPY AVSAQATLTG SGPYSDSGTM NIAYPSLSAA FNNVGTTDDA STAAGNLDGG GTSYSAQALS AAAGIAPGAT LTHDGATIVW PSAASGTKND IVASGQTVPV SGSGTTLSIV GTSTYGSSSG SGTIIYTDGS TQNYSLGFAD WWSTSAAQGT DFLAKPTYIN GGNGKITQAV NLSYAAIPLQ AGKTVQAVVL PNVSASAVSG SVSMHIFAVS VSGTSAGSVV SLRSHANNDI VTADNGGTSP LIANRTSVGQ WESFDLITNS DGSVSLRSHA NNDIVTADNA GAAPLIANRT SIGPWEEFDL IHNSDGSVSF RSHANNDIVT ADNAGAAPLI ANRTAVGPWE EFDLIHD
|
| |