Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7202 |
Symbol | |
ID | 8338570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8371904 |
End bp | 8373325 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644960283 |
Product | type 3a cellulose-binding domain protein |
Protein accession | YP_003117872 |
Protein GI | 256396308 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.601323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCACA CCCCCCGCCC CCGCCGCTCG CTCGCGGCCC TCACGGCCAC CGGCGTCGCC GGCGCCTTCG GCCTGACCGC GGCCGTCCTG TCCACCTCGG CGCACGCCGC CGGCGCGCCG GCGGCGCTGG CCTACCACGT CGGCTCGTCC GGCTCGGACA TGGTCGAGCC CTGGTTCAAG GTCACCAACA CCGGCGGCTC GGCGCTGAGC CTGGCGACGG TGAAGATCCG CTACTACTTC ACGATCGACT CGGCGCCGTC CTACCAGTTC ACCTGCGAAT GGGCGCAGAT CGGCTGCGGG AACGTCACCG GCACCATCGG CACCCTGCCC TCCCCGAGCG CGACCGCGGA CCACTATCTG GAGGTCGGGT TCACCGGCGG CAGCCTGGCG CCCGGCGCCT CGACCGGCGA CCTGCAGCTG CGGATGAACG CCGCCGGCTG GGCACCGGTC AAGCAGAGCA ACGACTACTC GTTCAACGCC TCGCAGACCT CGTACGGCTC GACGGGCACC ATCACGCTCG CCGTCAACGG CGCGATCGTC TCCGGCACGG CGCCCGGCGG CAACCCGCCC CCCACCAGCC CGTCCAGCTC ACCGAGCACC AGCCCGTCGA CCTCGCCGAG CTCCGGCGGC GGCGGGTCGG CGTCCGGGAC GCTGTTCGAC GACTTCTCCT ACTCCGGTCC GTCCGACCCG AACCTGGCCG CCCACGGCTG GGCGATCCGC ACCGGCGCCG GCGGTCCGGG CGTGCAGAAC TCCTGGGCGG CGGACACCTT CAGCTTCCCC AGCGACTCCA GCGCGCAGGG CGGGCACGTC ATGAACCTGG CGGCCTCGAC CGACGGCACC ACCTCCGGGA CCAAGCAGGC CGAGATCGAC ACCACGCAGG AGAAGTTCTT CGAGGGCACC TACGCCGCGC GCGTCTACTT CAACGACGCC CCGACCACCG GCACCAACGG CGACCACGTG AACGAGACCT TCTACACGAT CACCCCGGAC AACTCGCTCT ACAGCGAGGA CGACTTCGAG TACCTGCCCA ACGGCGGCTG GGGCGGCCCG GCGGACTCGA TGTACACGAC CAGCTGGTAC AGCGCCGACG CGATGGACCG GGTCACCACC GACACCATGG GCAGCCTGCA GGGCTGGCAC ACGCTGGTGG CGACGGTCTA CGGCGGCACC GTCACGTACT ACATCGACGG CAAGCAGGTG TTCTCCAGCA CCGGCAAGTA CTACCCGCGT GAGGCGATGA CCATCGACTT CAACGAGTGG TTCATCGACC TGCCGTTCAC CGGTGCGCGC ACCTGGAACG AGAAGGTCAA CTGGGTCTAC TACGCCAAGG GCGTGGCCCA GTCGCCGACC GATGTGCAGA ACGCGGTCAA CGGGTTCTAC CAGGGCGGCA CGCACTTCAA GGACACCGTG CCGAGCAGCT GA
|
Protein sequence | MRHTPRPRRS LAALTATGVA GAFGLTAAVL STSAHAAGAP AALAYHVGSS GSDMVEPWFK VTNTGGSALS LATVKIRYYF TIDSAPSYQF TCEWAQIGCG NVTGTIGTLP SPSATADHYL EVGFTGGSLA PGASTGDLQL RMNAAGWAPV KQSNDYSFNA SQTSYGSTGT ITLAVNGAIV SGTAPGGNPP PTSPSSSPST SPSTSPSSGG GGSASGTLFD DFSYSGPSDP NLAAHGWAIR TGAGGPGVQN SWAADTFSFP SDSSAQGGHV MNLAASTDGT TSGTKQAEID TTQEKFFEGT YAARVYFNDA PTTGTNGDHV NETFYTITPD NSLYSEDDFE YLPNGGWGGP ADSMYTTSWY SADAMDRVTT DTMGSLQGWH TLVATVYGGT VTYYIDGKQV FSSTGKYYPR EAMTIDFNEW FIDLPFTGAR TWNEKVNWVY YAKGVAQSPT DVQNAVNGFY QGGTHFKDTV PSS
|
| |