Gene Caci_3714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3714 
Symbol 
ID8335067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4176752 
End bp4178752 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content69% 
IMG OID644956854 
ProductGlucan endo-1,6-beta-glucosidase 
Protein accessionYP_003114457 
Protein GI256392893 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00391353 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCTCGAA GACTCTCAGC ACTGACTCTG ATCGGCGGAC TCCTCGTCGC GGCCGGCGCT 
ATCGTCCCGG TCGCGTCCGC CGCCACCGGC CAGGGCGCCG GGCACGACAC CGGGCGCGGG
CCGAAGCCTG TCGCCGCGCA CGTCTGGCTC ACCACCCCCG ACGGCGCGAA CCGGCTCGCC
GACGCCGGCA CGGTCGCCTT CGGGACCGTG CCGGCCACCG TCCCGACCGT CGTGGTCGAT
CCGACTCTGA CCTACCAGCG GATGCAGGGC TTCGGCGCGG CGATCACCGA TTCCTCCGCG
GCGGTGCTCT CGAACCTCTC CCCCGCGACG CGCACCGCCA CCATGCGCTC GCTGTTCGAC
CCGGTGACCG GCGACGGCCT GGACTACCTG CGCCAGCCGA TCGGCGGCTC GGACTTCGTC
GCGAGCGCCG CCTACACCTA CGACGACGTG CCCGCCGGAC AGACGGACTA CGCGCAGCGC
GACTTCTCCA TCGCGCACGA CAAGGCGCAG ATCATCCCGC TGCTGCGGCA GGCCAAGGCG
ATCAACCCCC GCCTGCAGAT CGTCGCCACG CCGTGGAGCC CGCCGGCGTG GATGAAGACC
GGCGGCTCGC TGACCGGCGG CCGGCTCATC GACGATCCGC GCGTCTATCA GGCCTACGCG
CTGTACCTGC TGAAGTTCGT CGAGGCCTAC CAGGCCGCCG GCGTCCCGGT CGACACCATC
ACGGTGCAGA ACGAGCCGCA GAACCGGACT CCGTCGGGCT ATCCGGGCAC TGACATGCCC
TCCTGGCAGG AGGAGAAGGT CATCGAGGAC CTCGGCCCGA TGCTGCGCCA GGCGCACCTG
CACACGCAGA TCCTCGCCTA CGACCACAAC TGGACCGAGC ACCCGAACGA CGTCGCCGCC
ACCCCGCCGG ACGAGACCGC GGACATCGAC GCCTACCCGC AGAACGTGCT GAACTCCCCG
GCCGCCAGGT GGGTCTCCGG CGTCGCGTTC CACTGCTACA GCGGCGATCC CAGCGCGATG
ACCGCGTTCC ACAACCAGTT CCCGGACAAG GCGATCTACT TCACCGAGTG CTCCGGCAAC
CAGTCGAGCG ACCCCGCGAA CACCTTCTCC GACACACTGA AATGGCACGC CAGGAACCTG
ACGATCGGCG CCACCCGCAA CTGGGCCGAG ACAGTAGTCA ACTGGAACCT GGCACTGGAC
CCCAGCGGCG GCCCGCACGT CGGAGGCTGC GGCACCTGCA CCGGCGTCGT CACCGTCAAC
CCGGACGGCA CCGTCACGGA CAATGCGGAG TACTACACCC TCGGACACCT GGCACGCTTC
GTGAAGCCGG GAGCGCTGCG CATCGCCAGC ACATCCTTCG GCACGACCGG CTGGAACGGC
CAGATCATGG ACGTCGCATT CCAGAACCCG GACGGCAGCA CCGCACTCGT CGCCCACAAC
GAGAACGACA ACCCCCAGAC CTTCGCAGTC CAGGAGGGCG ACCAGAACTT CACCTACACC
CTGCCCGGCG GCGCCCTGGC CACCTTCACC TGGAACGCAC ACCTCCCCGG CAGCACCACC
CTGCGCCAAC TCGACCCCAC CGGCTGGCAC GCCTCCGCGA ACCCCCCAGG CCCGACCGAC
CCCTGCTGCT CCGCCGACGT AGCGGCCAAC GCCACAGACG CCGACGCCAG CACCCGCTAC
TCCTCAGGCA CAGCCCAGGC AGCAGGCCAG TACCTGCAGG TCAACTTCGG CAAGGCCGTC
ACCGCACGCC GCGTCGTATT CGACACCGGC GCCTCCACCG GCGACTACCC GCGCGGCTAC
AGCGTGAGCA CCAGCAGGGA CGGCGTCTCA TGGACGACGG CTACCGTGTC GGGGGTTGGC
AGCGGCCAGT TCACGACGGT GGATCTGACC GGCGCGCCGA TCCGGTACGT CCGTCTAACG
CTTACTGCGG CTAACGGAAG CTGGTGGAGC GTCGCCGATG TGCGCGCGTA CACCGGCGGA
GGGTGGTCGG GCGACAAATA G
 
Protein sequence
MPRRLSALTL IGGLLVAAGA IVPVASAATG QGAGHDTGRG PKPVAAHVWL TTPDGANRLA 
DAGTVAFGTV PATVPTVVVD PTLTYQRMQG FGAAITDSSA AVLSNLSPAT RTATMRSLFD
PVTGDGLDYL RQPIGGSDFV ASAAYTYDDV PAGQTDYAQR DFSIAHDKAQ IIPLLRQAKA
INPRLQIVAT PWSPPAWMKT GGSLTGGRLI DDPRVYQAYA LYLLKFVEAY QAAGVPVDTI
TVQNEPQNRT PSGYPGTDMP SWQEEKVIED LGPMLRQAHL HTQILAYDHN WTEHPNDVAA
TPPDETADID AYPQNVLNSP AARWVSGVAF HCYSGDPSAM TAFHNQFPDK AIYFTECSGN
QSSDPANTFS DTLKWHARNL TIGATRNWAE TVVNWNLALD PSGGPHVGGC GTCTGVVTVN
PDGTVTDNAE YYTLGHLARF VKPGALRIAS TSFGTTGWNG QIMDVAFQNP DGSTALVAHN
ENDNPQTFAV QEGDQNFTYT LPGGALATFT WNAHLPGSTT LRQLDPTGWH ASANPPGPTD
PCCSADVAAN ATDADASTRY SSGTAQAAGQ YLQVNFGKAV TARRVVFDTG ASTGDYPRGY
SVSTSRDGVS WTTATVSGVG SGQFTTVDLT GAPIRYVRLT LTAANGSWWS VADVRAYTGG
GWSGDK