Gene Caci_7497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7497 
Symbol 
ID8338867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8690582 
End bp8694313 
Gene Length3732 bp 
Protein Length1243 aa 
Translation table11 
GC content69% 
IMG OID644960576 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003118163 
Protein GI256396599 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCCA TTGACACGAT CCGCACACGC CGCTCTGCGA CCCGCGCCTT CGGGGCCGCC 
GCTCTCGTCT GTACGGTCCT CGCCGGAACC GTCCAGATGG CGACGGCCGG CAGCAGCTCG
CAGAAAGCCC TGAGCACCCG CCTCGCCGCA CCGGTATCCG CCGCCGCGAC CGTCACTGAT
CCCTCGACCG GCTCGTCGGC GGTCCAGAAC CTCGGCGCGA CCACCGGCTG GAAGGTCCTC
ACCAGCGCCA CCGCCACCCA GACCGGCGCC CAGATCTCCG CCCCCGGCTT CTCCACCTCC
GGCTGGCTCA GCGTCGCCAA CGACGGCGGC GGAGCACCGG GCACGGAGAT CAATGCTTTG
CTGCAGAACG GTTCCTGCCC GAACGTCTAC TACTCCACCG ACATGAAGAC CTGCTTCGGG
CAGATGACCA AGGTCGGCGC GGAGACCATC GCGCAGTTCT CGGTCCCGTG GTGGTACCGG
ACCGACTTCA CCGCGCCGGC CGGCGGCCAG GGCGCCCGGC TGATCCTGAA CGGGGTCGTC
GGGACGGCCG ACGTGTGGGT CAACGGCACG GAGGTCGCCA CCTCCTCGAC CGTCACGGGG
GACTACGACA AGAGTGTCTT CGACATCTCC TCCAAGCTCC TGAGCGGCAC CAACTCGCTG
GCCATCGAGA TGCACCCGAA CAACCCGGGC TCGATGCTGA CGCTGGACAA CGTCGACTGG
AGCCAGATCC CGCCGGACAA CAACACCGGC ATCCAGTTCC CCGTGCAGCT CGAGGCCGGC
GGTCCGCTGA TCGTCGACGA CGCGCACGTG GACCAGAGCA CCGCGGCGAA CCTGTCGAGC
AGCGCGCTGA CGGTGAAGGC CTCGGTGGTC AACGTCTCCG CCGCCTCGCA GACCGGCGCG
GTCACCGCGA CCCTCACCCC GCCCGGCGGC GGCACGCCGG TCTCGGTGAC CCAGAACGTC
ACCGTCGCCG CGCACGCCAC CTCGGCCGTC ACCTTCGCGC CGGCGAGCTA CCCGGCGCTG
ACGCTGTCCT CGCCGAAGAT CTGGTGGCCC TACCAGATGG GCGCCCAGCC GCTCTACACC
CTGAGCACCT CGGTGGCGCA GAACTCGACC GTCCTGAATT CGACCTCTGA GACGTTCGGC
ATCCGCACCG TCACCTCCAG CCTCGTCGGA GCCGGCGGCG CCTCGCCCGA CGGAGTCCGC
CAGTTCGGCA TCAACGGTCA GCCGCTGGTG ATCCGCGGCG GCGGCTGGGA CCCGGACCTG
TTCCTGCGCT ACGACCCCGC GGACACCGCG CAGCAGATCG CGCTGATGAA GTCCATGGGC
CTGAACGCCA TCCGGCTCGA GGGCCACTTC ATGCCGCCGG ACTTCTACCA GCAGATGGAC
GCCGCCGGAA TCCTGATCAA CGTCGGCTAC CAGTGCTGCG ACGAGTGGGA GAAGAGCGGC
TCGGCCGGCA CGGTGTACCA GAACACCGCG GCGACCCAGG GCGCGATCTG GCGCAACCAC
CCGAGCATCT TCAGCTTCCA GTGGAGCGAC AACGCGCCGA CCTCGACGCA AGAGACCCAG
GCGCTGAACG GCTTCGCCTC GGCCGACTAC CCCGGTCCGT TCATCTCCTC CGCCGAGTAC
AACTCGAGCC CGCAGCTCGG GACGTCCGGG GAGAAGGAGG GTCCTTATGA CTGGGTCCCG
TCGAACTATT GGTACGACAC CACGCACTCC CCGTCCGGCG ACTCCACGCT GACCAACGCC
GGCGGCGCCT GGGGCTTCGA CTCCGAGCAG AGCGCGGGCG ACACCGTCCC GACGATGGAC
TCGCTCAACC GCTTCCTGTC GCCCGCCGAC CAGTCCGCGC TGTGGCAGAC CACCGCCGCC
AACCAGTACC ACGCGAACTA CGAGGGCACC GGCCACAGCG GCTACTCCTT CGGCACGCTC
TACAACCTGG ACCAGGCCGT CTCCAAGCGC TACGGAGCCT GGTCGAGCCT GGCGCAGTAC
GTGGAAGAGG CGCAGGCGCA GAACTACGAG GACACCCGCG CGCAGTTCGA AGCCTTCATC
GCCCACTCCA CCAACGCCAC GCAGCCCTCC ACCGGCACCA TCTACTGGCA GATGAACAAG
GGCTGGCCGA CGCTGCTGTG GTCGCTCTAC AACAACGACT ACGACCAGGC CGGCGCCTAC
TTCGGCGCGC AGGAGGCGAA CCGGTCGCTG CACGCGATCT ACACCCTCGA CAACCACACC
GTGACCGTCG ACAACCTGTC GGGCCAGACA CAGTCCGGCG TGACCGTCGA ATCGAAGGTC
TACAGCACCG CCGGCTCCGT GCTGGACGAC CAGACCTCGA GCTCGCTGTC CCTGGCCTCG
CAGAAGGTCC AGAACAAGGT GCTCACGCCG AAGCTGCCCA CCGCCGCCGG CACGGTCTAC
TTCGTCGAGC TGCTGGTCAA GCAGAACGGC ACCGTGGTCG ACCGCAACGT CTACTGGGAC
TCCACCACGC CGGACGCCGT CAACTGGGGC TCGACCATCC CCTCCGGCGG CGGCAACCCG
CAGGCCACGA TGACCTCCTA CGCGAACCTC ACCGGCCTGC AGAACCTGCC CGCCGCCACG
GTGTCCGCGA CCGCCGCGAC CAGCCGCCAG GCCGGTCCGA ACGGTGCTGA CAGCCTGGTC
ACGGTCACCG TCACCAACAA GTCCACGACC CCGGCCGTCG GCTTCCTGCT CCGCGCGGAC
CTGCGGCGCG GGACCGCTTC GGGCGGCGAG CAGTCCGGCG ACAGCGAAGT CACCTCCGCG
GTCTGGAGCG ACAACGACGT CACGCTGTGG CCCGGCGAAT CCGAGACGCT GACCGCGACC
TACAAGTCCG CCGATCTGCA GGGCGCGACG CCGGTCGTGA GCGTGTCGGG CTGGAACGCG
TCGAAGATCG ATGTCGTCGC CGGCACCGGA ACCGGTACCC CGAACGACTT CTCGATCTCG
GACTCCCCGG CCTCCGGAAA CGTGACTCAG GGTTCTTCGA CCACCGCCAC CGTGTCCACC
TCGGTGGCCG GCGGGAACGC GGAGTCCGTC GCGCTGACCG CCTCCGGTCT GCCGACCGGT
GCCACTGCGA CGTTCAGCCC GGCCGCGGTG ACGGCGGGCA AGTCCTCGAC GCTGACCATC
GCCGCCGCGG CGAGCACACC AGCGGGGACG TACCCGATCA CCATCACCGG TACCGCGCCG
TCGGCGACAC ACACCGCGAG CTACTCGCTG ACCGTCACAT CCTCGGGAGG CGGAGGCACC
TGCACCCCCG CGCAGCTGCT CGCCAACCCG GGCTTCGAAT CCGGCGCCAC CTCCTGGACC
CAGACATCCA CCTTGGGGTT CACCCCGATC ACCAAGGCCA CCTCGGCTGA GCCGGCGCAC
GCCGGTTCGT GGATCGCCTG GTTCAACGGC AACGGCAGCA AGGACACCGA CACCGCCGCG
CAGAGCGTGA CGATCCCGTC CGGGTGTACC GCGTCGCTGT CCTACTGGCT GCACATCGAC
ACCAGCGAGA GCACGACCAC CGCGAAACCG GACACGTTCA CCGTGCAGAT CCTGAACTCC
TCCGGGACCG TGCTCGCGAC CGTCGGCTCG TTCTCGAATC TCGACAAGGC CTCCGGCTAC
ACGCAGCACA CCGCCGACCT CTCGGCCTAC GCCGGGCAGA CGGTCACGGT GAAGTTCACC
GGCACGGAAG CCGACACCAG CGGCGGCACG ACCACCTTCG TGGCCGACGA CACCGCCCTG
CAGACCCACT GA
 
Protein sequence
MRSIDTIRTR RSATRAFGAA ALVCTVLAGT VQMATAGSSS QKALSTRLAA PVSAAATVTD 
PSTGSSAVQN LGATTGWKVL TSATATQTGA QISAPGFSTS GWLSVANDGG GAPGTEINAL
LQNGSCPNVY YSTDMKTCFG QMTKVGAETI AQFSVPWWYR TDFTAPAGGQ GARLILNGVV
GTADVWVNGT EVATSSTVTG DYDKSVFDIS SKLLSGTNSL AIEMHPNNPG SMLTLDNVDW
SQIPPDNNTG IQFPVQLEAG GPLIVDDAHV DQSTAANLSS SALTVKASVV NVSAASQTGA
VTATLTPPGG GTPVSVTQNV TVAAHATSAV TFAPASYPAL TLSSPKIWWP YQMGAQPLYT
LSTSVAQNST VLNSTSETFG IRTVTSSLVG AGGASPDGVR QFGINGQPLV IRGGGWDPDL
FLRYDPADTA QQIALMKSMG LNAIRLEGHF MPPDFYQQMD AAGILINVGY QCCDEWEKSG
SAGTVYQNTA ATQGAIWRNH PSIFSFQWSD NAPTSTQETQ ALNGFASADY PGPFISSAEY
NSSPQLGTSG EKEGPYDWVP SNYWYDTTHS PSGDSTLTNA GGAWGFDSEQ SAGDTVPTMD
SLNRFLSPAD QSALWQTTAA NQYHANYEGT GHSGYSFGTL YNLDQAVSKR YGAWSSLAQY
VEEAQAQNYE DTRAQFEAFI AHSTNATQPS TGTIYWQMNK GWPTLLWSLY NNDYDQAGAY
FGAQEANRSL HAIYTLDNHT VTVDNLSGQT QSGVTVESKV YSTAGSVLDD QTSSSLSLAS
QKVQNKVLTP KLPTAAGTVY FVELLVKQNG TVVDRNVYWD STTPDAVNWG STIPSGGGNP
QATMTSYANL TGLQNLPAAT VSATAATSRQ AGPNGADSLV TVTVTNKSTT PAVGFLLRAD
LRRGTASGGE QSGDSEVTSA VWSDNDVTLW PGESETLTAT YKSADLQGAT PVVSVSGWNA
SKIDVVAGTG TGTPNDFSIS DSPASGNVTQ GSSTTATVST SVAGGNAESV ALTASGLPTG
ATATFSPAAV TAGKSSTLTI AAAASTPAGT YPITITGTAP SATHTASYSL TVTSSGGGGT
CTPAQLLANP GFESGATSWT QTSTLGFTPI TKATSAEPAH AGSWIAWFNG NGSKDTDTAA
QSVTIPSGCT ASLSYWLHID TSESTTTAKP DTFTVQILNS SGTVLATVGS FSNLDKASGY
TQHTADLSAY AGQTVTVKFT GTEADTSGGT TTFVADDTAL QTH