Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6878 |
Symbol | |
ID | 8338244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 7947087 |
End bp | 7949888 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644959966 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003117557 |
Protein GI | 256395993 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0414286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTGC GTCCACTACC GCTGACGAAA TCAAGAGCTT TCGTCGCCCT CACCGCTCTG AGTCTCGGCG CCGCCGGGCT CCTGAGCGCC GAACTCCCCG CCTCCGCCGC CCCCGCGGCG TCGGCCACGT GTCCCTGGGT CGGGTCGAAC GCCCCGGTGG CCAGCCGTGT CAGCCAGCTG ATGGCCAAGA TGAGCCTGTC GCAGGAAATC TCCATGATGA CCGGCACCAA GGGGTCGAGC TTCGTGGGGG AGACCCCGGC GATCGGCTCG CTGTGCATCC CCGCCATGAA CCTGGAGGAC GGGCCGGCCG GCGTCGCCGA CGGCATGACC GGCGTCACCC AGCTGCCCGC CCCGGTGAGC GCGGCCGCCA CCTGGGACAC CGGCGCGGAA TCCGCCTACG GCAAGGTGAT CGGCTCCGAG GAGGCCGCCA AGGGCTCCAC CGTCGATCTC GGGCCGACGA TCAACATCGT GCGGGACCCG CGCTGGGGCC GGGCCTTCGA GTCGATCGGC GAGGACCCGT ACCTCAACGG CGTTCTCGGC GCCGCGGAGA TCCGCGGAGT GCAGTCGACC GGTGAGATGG CGCAGGTGAA GCACCTGGCC GCCTACAACC AGGAGACCCA CCGCAACACC TCCAGCGACA ACGTGATCGT CGACCAGCGC ACGCTGGAGG AGATCTACCT GCCGGCGTTC GACACGTCGG TGGGCTCGGG CGCGGCTTCG TCGGTCATGT GCTCGTACAG CACGATCAAC GGGACCTACG CCTGCCAGAA CCCGAACATC ATGAACGATG TCATCCACAA GCAGTTCGGC TCCAACGCGT TCATCACCTC GGACTGGGGC GCCCTGCACA CGACGGCCGG CGGCGCGAAC GCCGGGCTGG ACCAGGACAT GCCCGGTGAT GACGGCTACT ACGGTGGCGC GCTGCAGACA GCCGTCAACA ACGGCCAGGT CAGCAAGGCG ACGATCGACG CCGCGGTCCG CCGCGTCCTG ACCCAGATGT TCGGCTTCGG CATGTTCGAC AACGTCTCCA GCGGCTCGCC GCGCGCGACG GTGACGTCCT CGGCGCACAC CGCCACCGCC CGGCAGATCG CCGACCAGGG CACCGTGCTG CTGAAGAACG CCAACAACCA GCTGCCGCTG ACCTCCTCGA CGAAGTCCAT CGCCGTGATC GGTGCGGACG CCTCCACCAG CGTTCGCAGC GCCGGTGGCG GTAGCGCGTC GGTCAAGGCC GGCAGTGTGG TCAGCCCGCT GGCCGGGATC ACCTCGCGGG CCGGCAGCGG GGTGACGGTG ACCTACAACG ACGGCTCCTC CTCCAGCTCG GCCGCGACGG CCGCCGGCAA GGCGAACGTG GCAGTGGTCT TCGTCAGCAA GTCCGAGAGC GAGGGCGGCG ACCTGTCGAA CATCGACCTG GCCAGCTCGG ACAACGCGCT GATCTCGGCG GTCGCCAAGG CCAACCCGCA CACCGTGGTG GTGCTGAACA CCGGCTCCGC GGTCACCATG CCGTGGCTGT CCTCGGTCTC CGGCGTCTTC GAGGCCTGGT ACTCCGGCCA GGAGGACGGC AACGCGATCG CCGACCTGCT GTTCGGCGAC GTCGACCCGT CGGGGCACCT GCCGGTGACC TTCCCGACGT CGCTCAGCCA GGTGCCGGCG AACACCGCCG CCCAGTGGCC GGGCGCCAAC AACAAGGTGG AGTACTCCGA GGGCCTGCAG GTCGGCTACC GGCACTACGA CGCGACCAAC CAGACCCCTT TGTTCCCCTT CGGCTTCGGC CTGTCGTACA CGTCCTTCAG CTTCAGCAAC CTCACTGTCG GCTCGCTGAC CAAGGGCGGC TCCGCCACCG TCACCGCGAA GGTGACCAAC ACCGGCAGCC GTGCCGGCTC CGAGGTCGCC CAGCTCTACG TCGTGGACCC GTCGTCCTCC GGTGAGCCGT CGAAGCAGCT CCAGGGCTTC GGCAAGGTCA CCCTGGCCGC CGGCGCCAGC ACCACGGTCA GCTTCCCCGT CACCGAGGAG AACCTCCGGC ACTGGAACAC CGGGTCCAAC GCCTGGACCA CGGATACCGG CGCCTACGGC ATCCGCGTCG GCGACTCGGC GGCGAACATC CCGCTGAGCG GCACCCTGAA CGTCACCTCG GCCCAGCTCG GCCAGCCGGT GACGGTCACC AACCCGGGCC CGCAGGCCAC CATCGCGGGC GCGACGGTCT CCGTGCCGGT CACCGCGAAG GACTCGACCG GCGGCCAGAC GCCGGCGTTC AGCGCCACCG GGCTCCCGGC CGGCCTGAGC ATCTCCGCCT CGGGCACCAT CACCGGCACG CCCACCAGCG CGGGCACCAC GACCGTGGAC GTGACGGCGA AGGACGGCAA CGGCGCCACC GCGACGACCT CGTTCGTGTG GACCGTCTCG CCCTCCACCG CCGGGGTGCC CACCGTCCCG TACGTCGGTC AGGGCGGCAA GTGCCTCGAC CTGGCGGCGG ACGACAACAC CAACGGCGCC AAGGTCGAGA TCTACACCTG CAACAACACC AGCGGCCAGT CCTGGAGCCA CCTCGCCGAC GGCACCCTCC GGTCCGCCGG CAAGTGCATG GACGTCAAGG GCGCCGGAAC AGCCAACGGC ACCCTCGTCC AGATCTACGA CTGCAACGGC ACCGGTGCAC AGGTCTGGAA GAGCGGCTCC AACGGCTCGT TGATAAACCC GGCCTCCGGC AAGTGCCTCG ACGACCCGAA GTCCTCGACC ACTGACGGCA CCCAGCTGCA GATCTGGGAC TGCAACGGCA ATTCGAACCA GTCTTGGGTC CCGAGGGCTT GA
|
Protein sequence | MRVRPLPLTK SRAFVALTAL SLGAAGLLSA ELPASAAPAA SATCPWVGSN APVASRVSQL MAKMSLSQEI SMMTGTKGSS FVGETPAIGS LCIPAMNLED GPAGVADGMT GVTQLPAPVS AAATWDTGAE SAYGKVIGSE EAAKGSTVDL GPTINIVRDP RWGRAFESIG EDPYLNGVLG AAEIRGVQST GEMAQVKHLA AYNQETHRNT SSDNVIVDQR TLEEIYLPAF DTSVGSGAAS SVMCSYSTIN GTYACQNPNI MNDVIHKQFG SNAFITSDWG ALHTTAGGAN AGLDQDMPGD DGYYGGALQT AVNNGQVSKA TIDAAVRRVL TQMFGFGMFD NVSSGSPRAT VTSSAHTATA RQIADQGTVL LKNANNQLPL TSSTKSIAVI GADASTSVRS AGGGSASVKA GSVVSPLAGI TSRAGSGVTV TYNDGSSSSS AATAAGKANV AVVFVSKSES EGGDLSNIDL ASSDNALISA VAKANPHTVV VLNTGSAVTM PWLSSVSGVF EAWYSGQEDG NAIADLLFGD VDPSGHLPVT FPTSLSQVPA NTAAQWPGAN NKVEYSEGLQ VGYRHYDATN QTPLFPFGFG LSYTSFSFSN LTVGSLTKGG SATVTAKVTN TGSRAGSEVA QLYVVDPSSS GEPSKQLQGF GKVTLAAGAS TTVSFPVTEE NLRHWNTGSN AWTTDTGAYG IRVGDSAANI PLSGTLNVTS AQLGQPVTVT NPGPQATIAG ATVSVPVTAK DSTGGQTPAF SATGLPAGLS ISASGTITGT PTSAGTTTVD VTAKDGNGAT ATTSFVWTVS PSTAGVPTVP YVGQGGKCLD LAADDNTNGA KVEIYTCNNT SGQSWSHLAD GTLRSAGKCM DVKGAGTANG TLVQIYDCNG TGAQVWKSGS NGSLINPASG KCLDDPKSST TDGTQLQIWD CNGNSNQSWV PRA
|
| |