Gene Caci_3712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3712 
Symbol 
ID8335065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4171779 
End bp4175957 
Gene Length4179 bp 
Protein Length1392 aa 
Translation table11 
GC content67% 
IMG OID644956852 
ProductBeta-galactosidase 
Protein accessionYP_003114455 
Protein GI256392891 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.173046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0203648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTTC GGGTGCACAG TGTGACGCGG CTTCGGCACG GCGGCCGGGT GGTGGCGGCG 
GCGACCGCTG TCATGGCTCT GATATTCGGG TCCGTCGCTG GGGCGCACGC GAGCCCGGTG
CAGGGCAGAG ATGTCGCGCA GAGCAGGGAT GTCGCGGCCG CAACGGCGCC GGCTTCTGTG
GGGACTGCTC ATACGGTCAC GTATGACGGG TACTCGTTCC TCGTTGACGG CAGTCGCACC
TACCTGTGGT CTGGTGAGTT CCACTACTTC CGGCTGCCGA GTCCGAGTTT GTGGCTGGAC
ATCTTCCAGA AGATGAAGGC GGCTGGGTTC AATGCCACGT CGCTGTACTT CGACTGGGGC
TACCACTCGC CGGCGCCCGG GGTGTACGAC TTCACCGGCG TGCGGGATGT CGATGAGTTG
CTGGACATGG CGCAGCAGGC GGGCCTGTAT GTGATCGCGC GGCCCGCGCC GTACATCAAC
GCCGAGGTGG ACGGTGGCGG GCTGCCGGCT TGGCTCGGTA CGAAGGACGT GAAGAACCGG
ACCGACGACC CGGCTTTCCT GTCCTACGCC GATCAGTGGC TCACCCAGAT CGACGCGATC
CTCGCGCGGC ATCAGCTCAC CAATGGCACC GGTAGCGTGA TCGCATATCA GGTCGAGAAC
GAGTACTACA ACGGTTCGGC GACCGGCCGC GCCTACATGC AGCACCTTGA GGACAAGGCT
CGCGCCGACG GCATCACCGT GCCGCTGACC GGCAACAACA ACGGGACGTT CGGCAGCGGG
ACCGGTGCGC TGGACGTCGA CGGTCCCGAC TCCTACCCGC AGGGCTTCAA CTGCTCGAAT
CCGAGCGCGT GGAACGGTGT TCCCGACATC AGCTACGACC ACCCGGCCGG CAAGCCGCTG
TACACCCCGG AGTTCCAGGG CGGCGCCTTC GACCCCTGGG GCGGCCCGGG CTACGACAAG
TGCGCCCAGC TGATCAACGA TCAGTTCGCT GATGTCTTCT ACAAAAACAA CATCGCCGTC
GGCGCGACCG CGCAGAGCTT CTACATGACC TACGGCGGCA CCAACTGGGG CTGGCTCGGC
GAGCCCGAGA ACTACACCTC CTACGACTAC GGCGCGGCGA TCCGGGAGAC CCGGCAGCTC
GATCCGAAGT ACTCCGAGGA CAAACTGATC GGCGACGCGC TGGCCTCGAT GCCGGACCTG
ACCAAGACCG ACCCGATCCA GACCACCGCG CCGGACGACG CCGCGATCGT CGACACCGCG
CGGCGCAACC CCGACACCGG CGCGCAGTTC CATGTACTGC GGCACTCGGA CTCGACCTCG
ACCGCGGTGG ACAACACCCA CATCGCCGTT GATTTCAACG CGCTTCCGGC CGGGAACTAC
ACCTACGACG ACGTCGATCC GGTCCTGCAG TACACCGGCG CCTGGTCGCA CGTCGCCAAC
CAGAGCTACA CCGGCAGCGA CTTCAAGAAC ACTGAGTCGT TCTCCAACAC CGCCAACGAC
TCACTGACGG TTCCGTTCAC CGGCACCGCG ATCCGGTGGA TCGGCTCGAA GACCAACAAC
CACGGCTATG CCGACGTCTA CCTCGACGGC GTCAAGCAGA CCACCGTCGA CTGCTCCGGC
AGCCAGAGCC AGGCGGTGCT CTACCAGGCG AGCGGCCTGA CCGCCGGACC GCACACCCTC
AAGATCGTCG TGGACGGCAC CCACGCCTCC GGCTCGACCG ACAACTTCGT GTCCGTGGAC
GCCGTCGACC TGCCGCCCGC CGGAAGCGGA GCCGGCCCGA CGTATCCGAG CGTCCCCCAG
GAACCGGGCA CCGCGATCAC CCTCAACGGG CGCGAATCGG ACCTGCTGGT CGCGGACACC
AAGATCGGCG ACTCGCGGTT GCAGTACTCG ACCTCGCAGC TGATGACCTC GCAGACGATC
GGTAGCCGTG ACGTCGCAGT GTTCTACGGC GACAAGGGCA CCGACGGCGA AACGGTCCTG
CGGTACGCGA GCCGGCCGAC CGTCCAGAGC ACCGACGGCG CCGTCAAGGT GACCTGGGAC
GCCGCCAGCG GCGACCTGCG GCTGAACTAC CAGCACTCCG GCCTGACCCG GGTGACCATC
ACCGGCAGCG GCTCGCGTCC GCTGCTGCTC CTGCTCGCCG ACAAGCCGAC CGCCGAGACG
TTCTGGACGC AGAACACTGC CACGGGTCCG GTTCTCGTGC GCGGTACCCA TCTGTTGCGG
ACCGCCGCGA GTGCTGACGG TGGCAGGGTC CTGAATCTGA CCGGCGACAA CGGCACCGAC
CCCGGTATCG AGGTCTTCAC CTCCGCCACG TCGGTGACCT GGAACGGCCA TGCGGTGCAC
GCCAAGGGCT CCGCCACCGG AAGCCTCGTC GGCACGGTAT CCACCGCGGC GGCGATCACT
CTGCCCGCGC TCACCGACTG GAAGTACCAG GCTGAGTCGC CGGAGGCACA GTCAGGCTTC
GACGACTCGA CCTGGACGGT CGCGGACAAG ACGAGCACCA ACAGCGTCAC CGGTGTCGGT
TCGCTACCGG TTCTCTACGC CGACGACTAC GGCTTCCACA CCGGCAGCAC CTGGTACCGC
GGCAGGTTCC GCTCCTCCCC CACGGCCACC GGCATCCACC TGGTCTCTGA TTCGGGCGGA
GGCGCGCAGG CCTTCTCGGT CTGGCTGAAC GGGACATTCC TGGGCAGCTC CACCAACGGC
AGCGGCGACT TCACCTTCCC GGCCGGATCG CTGAAGCAGA GCGGGGACAA CATCGTCTCG
GTGCTCACCG TGAACATGGG TCACGAAGAG GACTACAACT CCACCAACAA CAGCACCTCT
GCGCGTGGGC TCACCAGTGC CTCGCTCGTC GGAGCTCCGC TGACGTCGGT GACCTGGCGG
TTGCAGGGCG TCCGCGGCGG CGAGCAGGAG ATCGACCCGG TGCGCGGTCC GCTGTCGACC
GGCGGTCTGT ACGGCGAGCG CGCCGGCTGG CCGCTGCCCG GCTTCGACGA CTCGGCGTGG
AAGCCGGTGA GCCTTCCGGC CCACGACACG ACCCCGGGCG TCGCCTGGTA CCGCACGACC
GCGAACCTGA ACCTGCCGAA GGGTCAGGAC ACCTCGCTCG GTCTCACCAT CACCGACGAT
CCGTCGAAGA AGTATCGCGC GGAGCTGTAC GTCAACGGCT GGATGGTCGG CAACTACGTC
AACTACCTCG GCCCGCAGCA CAGCTTCCCG ATCCCCAACG GGATCCTGAA GACCGACGGG
AGCAACACGA TCGCGATCGC GGTGTGGAAC CTGGACGGCA GCACCGGCGG CCTCGGCACG
GTCTCGCTCA CCGACTACGG CAGCTACGCG TCCTCGCTCA AGGTGGATAC GGTCGACAGT
CCCCGGTACA ACAAGGCCAC GTACGCGATG CCCGCGGCGC CGGGCGTGAA CGTGAACCTT
CAGGTCCCTG ACACCGCGCA AGCCGGGACC GCCTTCACCG CCACCGCGAC CGTGTCGGTC
CCGGCCGGTC GGGGACGCGC GAGCGGACTC ACGCCCTCGC TGAGCCTTCC GCCCGGCTGG
ACCGCCAGCG CCCCGAGCCC GGCAACTATC AGCTCTGTGA AGGACGGACA GTCGGCGACG
TTCACCTGGA GCGTGCAGCC ATCAGCCGGC GCCCAACCTT CAGCCGCCGC GCTCACCGCG
ACGATCGGCT ACACACAGCA CGACAAACCC GGCACGGCGA AGGACGAGCG CGTCGTCGGC
TACTACGTGC CGCCCGCAGC GGGTCAGGAC AACATCAGCG ACCTGGCATT CACCGCCGCG
ACCAACGGCT GGGGACCGGT CGAACGCGAC ATGAGCAACG GCGAGCAGGC CGCCGGCGAC
GGACACACCA TCACCATCAA CGGTGCGACC TCCGCCAAGG GCCTCGGCAC GAACGCGACC
AGCGACGTAC GGATCTACCT CGGCGGCCAC TGCACCACCT TCACCGCCTC GGTGGGCGTG
GACGACGAGA CCAACGGCGC CGGCACCGTC ACCTTCAGCG TCCTCGCCGA CGGCAGAACA
CTGACCACCA CCCCCGTCAT CGGCGGCCAC CAGGCAGCCA CGCAGCTGTC AGCCGACCTC
ACCGGCGCCC AGATGCTCGA CCTGGTGGTC GGCGACGGCG GCGACGGCAA CGCGCACGAC
CACGGGGACT GGGGAGGCGC GCAGATCACT TGCTCCTGA
 
Protein sequence
MRLRVHSVTR LRHGGRVVAA ATAVMALIFG SVAGAHASPV QGRDVAQSRD VAAATAPASV 
GTAHTVTYDG YSFLVDGSRT YLWSGEFHYF RLPSPSLWLD IFQKMKAAGF NATSLYFDWG
YHSPAPGVYD FTGVRDVDEL LDMAQQAGLY VIARPAPYIN AEVDGGGLPA WLGTKDVKNR
TDDPAFLSYA DQWLTQIDAI LARHQLTNGT GSVIAYQVEN EYYNGSATGR AYMQHLEDKA
RADGITVPLT GNNNGTFGSG TGALDVDGPD SYPQGFNCSN PSAWNGVPDI SYDHPAGKPL
YTPEFQGGAF DPWGGPGYDK CAQLINDQFA DVFYKNNIAV GATAQSFYMT YGGTNWGWLG
EPENYTSYDY GAAIRETRQL DPKYSEDKLI GDALASMPDL TKTDPIQTTA PDDAAIVDTA
RRNPDTGAQF HVLRHSDSTS TAVDNTHIAV DFNALPAGNY TYDDVDPVLQ YTGAWSHVAN
QSYTGSDFKN TESFSNTAND SLTVPFTGTA IRWIGSKTNN HGYADVYLDG VKQTTVDCSG
SQSQAVLYQA SGLTAGPHTL KIVVDGTHAS GSTDNFVSVD AVDLPPAGSG AGPTYPSVPQ
EPGTAITLNG RESDLLVADT KIGDSRLQYS TSQLMTSQTI GSRDVAVFYG DKGTDGETVL
RYASRPTVQS TDGAVKVTWD AASGDLRLNY QHSGLTRVTI TGSGSRPLLL LLADKPTAET
FWTQNTATGP VLVRGTHLLR TAASADGGRV LNLTGDNGTD PGIEVFTSAT SVTWNGHAVH
AKGSATGSLV GTVSTAAAIT LPALTDWKYQ AESPEAQSGF DDSTWTVADK TSTNSVTGVG
SLPVLYADDY GFHTGSTWYR GRFRSSPTAT GIHLVSDSGG GAQAFSVWLN GTFLGSSTNG
SGDFTFPAGS LKQSGDNIVS VLTVNMGHEE DYNSTNNSTS ARGLTSASLV GAPLTSVTWR
LQGVRGGEQE IDPVRGPLST GGLYGERAGW PLPGFDDSAW KPVSLPAHDT TPGVAWYRTT
ANLNLPKGQD TSLGLTITDD PSKKYRAELY VNGWMVGNYV NYLGPQHSFP IPNGILKTDG
SNTIAIAVWN LDGSTGGLGT VSLTDYGSYA SSLKVDTVDS PRYNKATYAM PAAPGVNVNL
QVPDTAQAGT AFTATATVSV PAGRGRASGL TPSLSLPPGW TASAPSPATI SSVKDGQSAT
FTWSVQPSAG AQPSAAALTA TIGYTQHDKP GTAKDERVVG YYVPPAAGQD NISDLAFTAA
TNGWGPVERD MSNGEQAAGD GHTITINGAT SAKGLGTNAT SDVRIYLGGH CTTFTASVGV
DDETNGAGTV TFSVLADGRT LTTTPVIGGH QAATQLSADL TGAQMLDLVV GDGGDGNAHD
HGDWGGAQIT CS