Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4214 |
Symbol | |
ID | 8335568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4776901 |
End bp | 4778727 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644957317 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003114919 |
Protein GI | 256393355 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.303504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCGG CGTTGATCGC GGCGTCGGTC ATGACCTTCG CCGGGGTGGC CGCGCAGCCG GCCACCGCGC GCACCGATGC CGCCGCCGCC CACACCACCG CCGCCACCGC CGGCACCACC GGCACCGCCC CGGCCGCGAC AGCGGCGGCA GGCGCCGGAT ACTGGCACAC CAGCGGCCGG GAGATCCTGG ACTCGGCCAA CCAGCCGGTC CGCATCGCCG GCGTCAACTG GTTCGGCTTC GAGACCAGCA ACGACGTCGT CCACGGCCTG TGGAGCCGGG ACTACAAGTC GATGATGGAC CAGATGAAAT CGCTGGGCTA CAACACGATC CGGCTGCCCT ACAGCGACGA CATCTTCAAG CCCGGCACGA TGCCGAACAG CATCAACTTC TACCAGATGA ACACCGACCT GCAGGGTCTG ACCTCGCTGC AGGTGATGGA CAAGATCGTG GACTACGCCG GCTCGATCGG CCTGAAGGTG ATTTTGGACC GCCACCGTCC GGACGCCGGC GGGCAGTCGG CGCTGTGGTA CACCTCCACA GTCCCGGAAT CGACGTGGAT CAACGACCTG AAAGCCATCG CCACGCGCTA CCAGGGCAAC CCGGCGGTGG TCGGGATCGA CCTGCACAAC GAGCCGCACG ACCCGGCGTG CTGGGGCTGC GGGGACACCA CGATCGACTG GCGGCTGGCC GCCGAGCGCG GCGGCAACGC GGTGCTGTCC GTCAACCCCA GCCTGTTGAT CTTCGTCGAG GGTGTTCAGA CCTTCAACGG CAGCTCCTAC TGGTGGGGCG GCAACCTACA GGGCGCCGGG CAGTATCCGG TGCAGCTGTC GGTGGCCAAC CGGGTCGTCT ACTCGGCCCA CGACTACGCC ACCAGCGTGG CCAGCCAGCC GTGGTTCACC GACCCGAGCT TCCCCTCCAA CATGGCCGGG ATCTGGGACA AGAACTGGGG CTACCTGTTC AACCAGAACA TCGCCCCCGT GTGGGTCGGC GAGTTCGGCA CGACGCTGTC CGCCACGACC GACCAGGTCT GGCTCAAGAC GCTGGTCTCC TACCTGCGTC CGACCAGCAC CTACGGCGGC GACTCCTACC AGTGGACGTT CTGGTCCTGG AACCCGGACT CCGGTGACAC CGGCGGCATC CTGAAGGACG ACTGGAACTC GGTGGACACC GTCAAGGACG GCTATTTGAC CTCGATCAAG GCACCGGCCT TCGGCGGCGG GGGCGGCGGA GGCGGAGGCG ACAGCTCGCC GCCGACCGCG CCGACCAACG TGGCGGCGAC CGTCACCACC TCCAGCTCGG TAGCGCTGTC CTGGACCGCC TCGACCGACA ACGTCGGCGT CACCGGCTAT GACATCTACC GGGGCAGCAC CCTCGCCGGA ACGTCGGCGG GCACGACGTT CACCGACAGT GGACTGACAC CCTCGACCGC GTACACCTAC ACGGTCAAGG CGTACGACGC AGCCGGCAAC CTGTCGGCGG CCTCGGCCGC GGTGTCCGCG ACGACGCAGG GCGGCGGCAG CAACGCCGGC TGCACCGCGG CGTACACGGT GTCGAACCAG TGGAACAACG GCTTCACCGC CGCGGTCACG ATCACCAACT CCGGAACCGC GGCGACCCAC AGCTGGAAGG TCACCTGGAC GTGGCCGGGC GGCCAGCAGG TCAGCAACGC CTGGAACGCC ACCGAAGCCC ACAGCGGGCA GAGCGAGACG TTCACCGCGG TGAGCTCCGA CGCGGTGGTC GCGCCGGGAG GCACGACGTC GTTCGGATTC CAGGCGAGCT ACAACGGGAC CAACCCGGCA CCGACTCCAG TGTGCACAGC GAGCTGA
|
Protein sequence | MAAALIAASV MTFAGVAAQP ATARTDAAAA HTTAATAGTT GTAPAATAAA GAGYWHTSGR EILDSANQPV RIAGVNWFGF ETSNDVVHGL WSRDYKSMMD QMKSLGYNTI RLPYSDDIFK PGTMPNSINF YQMNTDLQGL TSLQVMDKIV DYAGSIGLKV ILDRHRPDAG GQSALWYTST VPESTWINDL KAIATRYQGN PAVVGIDLHN EPHDPACWGC GDTTIDWRLA AERGGNAVLS VNPSLLIFVE GVQTFNGSSY WWGGNLQGAG QYPVQLSVAN RVVYSAHDYA TSVASQPWFT DPSFPSNMAG IWDKNWGYLF NQNIAPVWVG EFGTTLSATT DQVWLKTLVS YLRPTSTYGG DSYQWTFWSW NPDSGDTGGI LKDDWNSVDT VKDGYLTSIK APAFGGGGGG GGGDSSPPTA PTNVAATVTT SSSVALSWTA STDNVGVTGY DIYRGSTLAG TSAGTTFTDS GLTPSTAYTY TVKAYDAAGN LSAASAAVSA TTQGGGSNAG CTAAYTVSNQ WNNGFTAAVT ITNSGTAATH SWKVTWTWPG GQQVSNAWNA TEAHSGQSET FTAVSSDAVV APGGTTSFGF QASYNGTNPA PTPVCTAS
|
| |