Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5006 |
Symbol | |
ID | 8336360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5730922 |
End bp | 5732472 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958105 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003115707 |
Protein GI | 256394143 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.180234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGC CCACGCACCA GCATCCAGAA CGTCGCCGGC GCCATCCATG GCGTGCGGTA GCCGCTTGGA CAGCGGCCGC AGCGCTGATC ACCGCCACGC TGTCGGCGAT CGGCGGCACC GCCTCGGCGG CGACCGGCGC GACCGGCACG ATCACGGGCC AGCAGAGCGG ACGCTGCGTC GACGCGCAAA GCGCCGGCAC AGCGAACGGC ACCGTGGTCC AGCTCTACGA CTGCAACGGC ACCGGCGCGC AGCAGTGGCA GGTCCGCTCC GACGGCTCCG TGCTGAACCC CAACTCGGGC CGATGCCTCG ACGTGACCGG CGCCGGCACG GCGAACGGAA CCCGCGTCCA GCTCTACGAC TGCAACGGCA CCGGCGCGCA ACACTGGCAA GTACGCGCCG ACGGCTCCGT CCTGAACGCC ACCTCGGGAC GGTGCCTCGA CGCGAACGGC AGCGCCAACG GCTCCTACCT GCAAATCTGG GACTGCGCGG GTAGCGGCAA CCAGCACTGG ACGGTCAACG GCAGCAGCGG CGGAGGCGGT GACAACGCCT TCTGGTCGCA GATCCACGCC GGCTGGAACC TCGGCAACTC CTTCGACGCG ACCCCCTGCG AGACCTGCTG GGGCAACCCG GCGACGACGA AGCCGATGAT CGACCTGATC GCCACCAAGT TCAACTACCT GCGCATCCCG GTGACCTGGT ACCCGCACAT GACCAGCGGA GCGCCGAACT ACACCGTCGA CCCCGCCTTC TTCGCGCGCC TGAAGCAGGT CGTGGACTGG GCCATCGCCG ACAACATGTA CGTGGACGTC AACGTCCATC ACGACGGCGG CGGCGGCAAC TGGCTCACCC CGAGCACCGG AGCCATGGGC ACGACCGAGC CGGAGTTCAC CGCGCTGTGG CGGCAGATCG CCACCTATTT CAACGGCGAG AGCGACCATC TGTTGTTCGA GGCGATGAAC GAGCCGCAGG ACGCCAACGG CGGCAACCGC TACGGCGGCG GCACGTCCGA CAACTGGGGA CCCATCAACA CCCTGAATCA GGACTTCGTG AACACGGTCC GCGCGACCGG CGGCGCCAAC GCCACCCGCT GGCTGATCGT GGTCCCCTAC GGCGCGAACG CCCAGACCGG CGCCGACAAC CTCGCCGTGC CCGCCGGGGC GAACATCGCG GTGTCGGTGC ACACGTACAA CCCCTGGGCC TTCTGCTCCA CCACCGCCCC GAACTACGTC ACCTGGGACG GCTCGATGAA CTACCTGCCC GACGGCGACG TGGACAACGC CAACCGGCTG TTCACCAGCC GCGGCATCCC GGTGATCTGG ACCGAGTACG GCGCCACCGT CAAGCCGTAC AACGGCGGCG ACAACTCGGC GCAGGTCGCG AACTTCGAAT CACACATCAC TTCCTACGCC GCCCAGCACG GCCAGAAGAC AGTGGTGTGG GACAACGGCT CGATCGGCGT CGGCGACGAT CAGTTCGGCC TCATGAACCG CAACTCAGTG CAATGGCAGC ACTCGAACAT CGTTAACGCG ATCCAGGCTG CGGCCGGCTG A
|
Protein sequence | MTQPTHQHPE RRRRHPWRAV AAWTAAAALI TATLSAIGGT ASAATGATGT ITGQQSGRCV DAQSAGTANG TVVQLYDCNG TGAQQWQVRS DGSVLNPNSG RCLDVTGAGT ANGTRVQLYD CNGTGAQHWQ VRADGSVLNA TSGRCLDANG SANGSYLQIW DCAGSGNQHW TVNGSSGGGG DNAFWSQIHA GWNLGNSFDA TPCETCWGNP ATTKPMIDLI ATKFNYLRIP VTWYPHMTSG APNYTVDPAF FARLKQVVDW AIADNMYVDV NVHHDGGGGN WLTPSTGAMG TTEPEFTALW RQIATYFNGE SDHLLFEAMN EPQDANGGNR YGGGTSDNWG PINTLNQDFV NTVRATGGAN ATRWLIVVPY GANAQTGADN LAVPAGANIA VSVHTYNPWA FCSTTAPNYV TWDGSMNYLP DGDVDNANRL FTSRGIPVIW TEYGATVKPY NGGDNSAQVA NFESHITSYA AQHGQKTVVW DNGSIGVGDD QFGLMNRNSV QWQHSNIVNA IQAAAG
|
| |