Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3712 |
Symbol | |
ID | 8335065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4171779 |
End bp | 4175957 |
Gene Length | 4179 bp |
Protein Length | 1392 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956852 |
Product | Beta-galactosidase |
Protein accession | YP_003114455 |
Protein GI | 256392891 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.173046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0203648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTTC GGGTGCACAG TGTGACGCGG CTTCGGCACG GCGGCCGGGT GGTGGCGGCG GCGACCGCTG TCATGGCTCT GATATTCGGG TCCGTCGCTG GGGCGCACGC GAGCCCGGTG CAGGGCAGAG ATGTCGCGCA GAGCAGGGAT GTCGCGGCCG CAACGGCGCC GGCTTCTGTG GGGACTGCTC ATACGGTCAC GTATGACGGG TACTCGTTCC TCGTTGACGG CAGTCGCACC TACCTGTGGT CTGGTGAGTT CCACTACTTC CGGCTGCCGA GTCCGAGTTT GTGGCTGGAC ATCTTCCAGA AGATGAAGGC GGCTGGGTTC AATGCCACGT CGCTGTACTT CGACTGGGGC TACCACTCGC CGGCGCCCGG GGTGTACGAC TTCACCGGCG TGCGGGATGT CGATGAGTTG CTGGACATGG CGCAGCAGGC GGGCCTGTAT GTGATCGCGC GGCCCGCGCC GTACATCAAC GCCGAGGTGG ACGGTGGCGG GCTGCCGGCT TGGCTCGGTA CGAAGGACGT GAAGAACCGG ACCGACGACC CGGCTTTCCT GTCCTACGCC GATCAGTGGC TCACCCAGAT CGACGCGATC CTCGCGCGGC ATCAGCTCAC CAATGGCACC GGTAGCGTGA TCGCATATCA GGTCGAGAAC GAGTACTACA ACGGTTCGGC GACCGGCCGC GCCTACATGC AGCACCTTGA GGACAAGGCT CGCGCCGACG GCATCACCGT GCCGCTGACC GGCAACAACA ACGGGACGTT CGGCAGCGGG ACCGGTGCGC TGGACGTCGA CGGTCCCGAC TCCTACCCGC AGGGCTTCAA CTGCTCGAAT CCGAGCGCGT GGAACGGTGT TCCCGACATC AGCTACGACC ACCCGGCCGG CAAGCCGCTG TACACCCCGG AGTTCCAGGG CGGCGCCTTC GACCCCTGGG GCGGCCCGGG CTACGACAAG TGCGCCCAGC TGATCAACGA TCAGTTCGCT GATGTCTTCT ACAAAAACAA CATCGCCGTC GGCGCGACCG CGCAGAGCTT CTACATGACC TACGGCGGCA CCAACTGGGG CTGGCTCGGC GAGCCCGAGA ACTACACCTC CTACGACTAC GGCGCGGCGA TCCGGGAGAC CCGGCAGCTC GATCCGAAGT ACTCCGAGGA CAAACTGATC GGCGACGCGC TGGCCTCGAT GCCGGACCTG ACCAAGACCG ACCCGATCCA GACCACCGCG CCGGACGACG CCGCGATCGT CGACACCGCG CGGCGCAACC CCGACACCGG CGCGCAGTTC CATGTACTGC GGCACTCGGA CTCGACCTCG ACCGCGGTGG ACAACACCCA CATCGCCGTT GATTTCAACG CGCTTCCGGC CGGGAACTAC ACCTACGACG ACGTCGATCC GGTCCTGCAG TACACCGGCG CCTGGTCGCA CGTCGCCAAC CAGAGCTACA CCGGCAGCGA CTTCAAGAAC ACTGAGTCGT TCTCCAACAC CGCCAACGAC TCACTGACGG TTCCGTTCAC CGGCACCGCG ATCCGGTGGA TCGGCTCGAA GACCAACAAC CACGGCTATG CCGACGTCTA CCTCGACGGC GTCAAGCAGA CCACCGTCGA CTGCTCCGGC AGCCAGAGCC AGGCGGTGCT CTACCAGGCG AGCGGCCTGA CCGCCGGACC GCACACCCTC AAGATCGTCG TGGACGGCAC CCACGCCTCC GGCTCGACCG ACAACTTCGT GTCCGTGGAC GCCGTCGACC TGCCGCCCGC CGGAAGCGGA GCCGGCCCGA CGTATCCGAG CGTCCCCCAG GAACCGGGCA CCGCGATCAC CCTCAACGGG CGCGAATCGG ACCTGCTGGT CGCGGACACC AAGATCGGCG ACTCGCGGTT GCAGTACTCG ACCTCGCAGC TGATGACCTC GCAGACGATC GGTAGCCGTG ACGTCGCAGT GTTCTACGGC GACAAGGGCA CCGACGGCGA AACGGTCCTG CGGTACGCGA GCCGGCCGAC CGTCCAGAGC ACCGACGGCG CCGTCAAGGT GACCTGGGAC GCCGCCAGCG GCGACCTGCG GCTGAACTAC CAGCACTCCG GCCTGACCCG GGTGACCATC ACCGGCAGCG GCTCGCGTCC GCTGCTGCTC CTGCTCGCCG ACAAGCCGAC CGCCGAGACG TTCTGGACGC AGAACACTGC CACGGGTCCG GTTCTCGTGC GCGGTACCCA TCTGTTGCGG ACCGCCGCGA GTGCTGACGG TGGCAGGGTC CTGAATCTGA CCGGCGACAA CGGCACCGAC CCCGGTATCG AGGTCTTCAC CTCCGCCACG TCGGTGACCT GGAACGGCCA TGCGGTGCAC GCCAAGGGCT CCGCCACCGG AAGCCTCGTC GGCACGGTAT CCACCGCGGC GGCGATCACT CTGCCCGCGC TCACCGACTG GAAGTACCAG GCTGAGTCGC CGGAGGCACA GTCAGGCTTC GACGACTCGA CCTGGACGGT CGCGGACAAG ACGAGCACCA ACAGCGTCAC CGGTGTCGGT TCGCTACCGG TTCTCTACGC CGACGACTAC GGCTTCCACA CCGGCAGCAC CTGGTACCGC GGCAGGTTCC GCTCCTCCCC CACGGCCACC GGCATCCACC TGGTCTCTGA TTCGGGCGGA GGCGCGCAGG CCTTCTCGGT CTGGCTGAAC GGGACATTCC TGGGCAGCTC CACCAACGGC AGCGGCGACT TCACCTTCCC GGCCGGATCG CTGAAGCAGA GCGGGGACAA CATCGTCTCG GTGCTCACCG TGAACATGGG TCACGAAGAG GACTACAACT CCACCAACAA CAGCACCTCT GCGCGTGGGC TCACCAGTGC CTCGCTCGTC GGAGCTCCGC TGACGTCGGT GACCTGGCGG TTGCAGGGCG TCCGCGGCGG CGAGCAGGAG ATCGACCCGG TGCGCGGTCC GCTGTCGACC GGCGGTCTGT ACGGCGAGCG CGCCGGCTGG CCGCTGCCCG GCTTCGACGA CTCGGCGTGG AAGCCGGTGA GCCTTCCGGC CCACGACACG ACCCCGGGCG TCGCCTGGTA CCGCACGACC GCGAACCTGA ACCTGCCGAA GGGTCAGGAC ACCTCGCTCG GTCTCACCAT CACCGACGAT CCGTCGAAGA AGTATCGCGC GGAGCTGTAC GTCAACGGCT GGATGGTCGG CAACTACGTC AACTACCTCG GCCCGCAGCA CAGCTTCCCG ATCCCCAACG GGATCCTGAA GACCGACGGG AGCAACACGA TCGCGATCGC GGTGTGGAAC CTGGACGGCA GCACCGGCGG CCTCGGCACG GTCTCGCTCA CCGACTACGG CAGCTACGCG TCCTCGCTCA AGGTGGATAC GGTCGACAGT CCCCGGTACA ACAAGGCCAC GTACGCGATG CCCGCGGCGC CGGGCGTGAA CGTGAACCTT CAGGTCCCTG ACACCGCGCA AGCCGGGACC GCCTTCACCG CCACCGCGAC CGTGTCGGTC CCGGCCGGTC GGGGACGCGC GAGCGGACTC ACGCCCTCGC TGAGCCTTCC GCCCGGCTGG ACCGCCAGCG CCCCGAGCCC GGCAACTATC AGCTCTGTGA AGGACGGACA GTCGGCGACG TTCACCTGGA GCGTGCAGCC ATCAGCCGGC GCCCAACCTT CAGCCGCCGC GCTCACCGCG ACGATCGGCT ACACACAGCA CGACAAACCC GGCACGGCGA AGGACGAGCG CGTCGTCGGC TACTACGTGC CGCCCGCAGC GGGTCAGGAC AACATCAGCG ACCTGGCATT CACCGCCGCG ACCAACGGCT GGGGACCGGT CGAACGCGAC ATGAGCAACG GCGAGCAGGC CGCCGGCGAC GGACACACCA TCACCATCAA CGGTGCGACC TCCGCCAAGG GCCTCGGCAC GAACGCGACC AGCGACGTAC GGATCTACCT CGGCGGCCAC TGCACCACCT TCACCGCCTC GGTGGGCGTG GACGACGAGA CCAACGGCGC CGGCACCGTC ACCTTCAGCG TCCTCGCCGA CGGCAGAACA CTGACCACCA CCCCCGTCAT CGGCGGCCAC CAGGCAGCCA CGCAGCTGTC AGCCGACCTC ACCGGCGCCC AGATGCTCGA CCTGGTGGTC GGCGACGGCG GCGACGGCAA CGCGCACGAC CACGGGGACT GGGGAGGCGC GCAGATCACT TGCTCCTGA
|
Protein sequence | MRLRVHSVTR LRHGGRVVAA ATAVMALIFG SVAGAHASPV QGRDVAQSRD VAAATAPASV GTAHTVTYDG YSFLVDGSRT YLWSGEFHYF RLPSPSLWLD IFQKMKAAGF NATSLYFDWG YHSPAPGVYD FTGVRDVDEL LDMAQQAGLY VIARPAPYIN AEVDGGGLPA WLGTKDVKNR TDDPAFLSYA DQWLTQIDAI LARHQLTNGT GSVIAYQVEN EYYNGSATGR AYMQHLEDKA RADGITVPLT GNNNGTFGSG TGALDVDGPD SYPQGFNCSN PSAWNGVPDI SYDHPAGKPL YTPEFQGGAF DPWGGPGYDK CAQLINDQFA DVFYKNNIAV GATAQSFYMT YGGTNWGWLG EPENYTSYDY GAAIRETRQL DPKYSEDKLI GDALASMPDL TKTDPIQTTA PDDAAIVDTA RRNPDTGAQF HVLRHSDSTS TAVDNTHIAV DFNALPAGNY TYDDVDPVLQ YTGAWSHVAN QSYTGSDFKN TESFSNTAND SLTVPFTGTA IRWIGSKTNN HGYADVYLDG VKQTTVDCSG SQSQAVLYQA SGLTAGPHTL KIVVDGTHAS GSTDNFVSVD AVDLPPAGSG AGPTYPSVPQ EPGTAITLNG RESDLLVADT KIGDSRLQYS TSQLMTSQTI GSRDVAVFYG DKGTDGETVL RYASRPTVQS TDGAVKVTWD AASGDLRLNY QHSGLTRVTI TGSGSRPLLL LLADKPTAET FWTQNTATGP VLVRGTHLLR TAASADGGRV LNLTGDNGTD PGIEVFTSAT SVTWNGHAVH AKGSATGSLV GTVSTAAAIT LPALTDWKYQ AESPEAQSGF DDSTWTVADK TSTNSVTGVG SLPVLYADDY GFHTGSTWYR GRFRSSPTAT GIHLVSDSGG GAQAFSVWLN GTFLGSSTNG SGDFTFPAGS LKQSGDNIVS VLTVNMGHEE DYNSTNNSTS ARGLTSASLV GAPLTSVTWR LQGVRGGEQE IDPVRGPLST GGLYGERAGW PLPGFDDSAW KPVSLPAHDT TPGVAWYRTT ANLNLPKGQD TSLGLTITDD PSKKYRAELY VNGWMVGNYV NYLGPQHSFP IPNGILKTDG SNTIAIAVWN LDGSTGGLGT VSLTDYGSYA SSLKVDTVDS PRYNKATYAM PAAPGVNVNL QVPDTAQAGT AFTATATVSV PAGRGRASGL TPSLSLPPGW TASAPSPATI SSVKDGQSAT FTWSVQPSAG AQPSAAALTA TIGYTQHDKP GTAKDERVVG YYVPPAAGQD NISDLAFTAA TNGWGPVERD MSNGEQAAGD GHTITINGAT SAKGLGTNAT SDVRIYLGGH CTTFTASVGV DDETNGAGTV TFSVLADGRT LTTTPVIGGH QAATQLSADL TGAQMLDLVV GDGGDGNAHD HGDWGGAQIT CS
|
| |