Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5258 |
Symbol | |
ID | 8336612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 6054139 |
End bp | 6056100 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644958356 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003115958 |
Protein GI | 256394394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.35373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTCCA GGGCACGACC GCGTTTCCGC CACCGCAGGA CGGTCGCCGC GCTCAGCGCC GTCAGCCTGG CCGGCCTGAC GGTCGGCACC GCCACCGTGC TCTCGCAGCC CGCGCAGGCC GCGCAGTCCG CGAACACCGG GCTGGTCCGC GTCGACCAGG CCGGGTACCT GGCCGGGGAC GTGAAGCAGG CGTACCTGAT GACCGGCGGG GCGGTGTCTG GGGCGAAGTT CTCGGTGCTG AACGCCAAGG GCAAGACGGT ACTCACCGGC AAGGTCGGCG GCACCAGCCT CGGCAAGTGG AACGCCGCCT ATCCGGACGT CTACCCGATC GTCTTCAGCG GCTTGAAGAC GCCGGGGACC TACCACATCG CGGTCGCCGG GAGCGCCTCG GGCAGCTCGC CGACGTTCAC CGTCACCAGC TCCGGCTCGC TCTACGGCAA GCTGGTCACC GACGGCGTCA CCTTCTTCCA GACCCAGCGC GACGGCTCGA ACGTCGTCCC CGGCGCCCTG AACCGCAAGC CCTCGCACCT GAACGACGCC GCGGCGAGCC TCTACGCCTG GCCGACCTTC GCCCCCGACG ACTCGGACAC CATCACCGAC GCCGACCTGA CCAAACTCGG CGGTACCGTC GACGTCTCCG GCGGCTGGTT CGACGCCGGC GACTACTTGA AGTTCTCCAA CAACGAGGCC TTCGGCGACA TCACGCTGCT GGCCGCGCAG CGCGCCCTGG GCTCCTCCGC CCCGGCCTCG CTGACCGCCG AGGCGCACTA CGGCGAGACC TGGCTGAACA AGGCCTGGAA CCAGAAGACG AAGACGCTGG TCTTGCAGGT CGGCATCGGC TCGGGCAATG CCGCCGGTAC TTTCACCGGC GATCACGACC TGTGGCGCCT GCCGCAGAAG GACGACGGCG ACACCGCCAC CGCCGACCGC TACTCCGCCG CGCACCGCCC GGCGTTCCTC GCCGCCAGTC CGGGGGCGAG GATTAGCCCG AACATCGCCG GGCGCGTGGC GGCGGCGTTC GCCCTGGCCG CGCAGGTCGA CGCGAAGAGC AACCCCAAGC AGGCCGCCGC CGAGTACCAG GCTGCCGCCT CGGTGTATGC GCAGGCTGAT ACCAGCGCTC CGCCGAGCCC GCTGACCACC GCGCTGCCGA ACGGCTACTA CCCCGAGTCG ATCTGGCACG ACGCGATGGA GTTGGGCGGC GCCGAACTGG CACTGGCCGC GCAGAAGCTG GGACACAGCC CTTCTTCGTA CCTGTCGCAG GCCGCTACTT ACGCCAAGGA CTACATCGCC TCCGACACCG GCGACACGTT CAACCTCTAC GACAACAGTG CCCTGGCACA CGCCGACCTG ATCAAGGCGA TCGCCGCCGC CGGCAACCCG TCGGGGCTGG CGGTCACTCG TGCCGCACTG ACCGCGGACC TGAAGCGGCA GGTGCAGTCG GCGGCGAGCA AGGCCTCCTC CGACGTCTTC CACGCCGGCG GCGACTACGC GGACTTCGAC GTCAACGCGC ACACCTTCGG CTTCCTGACC GAGGAGGCGC TGTACCGGCA GGCCAGCGGC GACACCTCGT TCCAGTCCTT CGCCACCGAA CAGCGCGACT GGCTGCTGGG CGCCAACGCC TGGGGACAGG CGTTCATGGT GGGAGAGGGC AGCACCTTCC CGAAGTGCAT GCAGCACCAG GTCGCGAACC TGTCCGGCAG CCTGAACGGC ACCGGCGCGA TCGCCACCGG CGCGGTGATG AACGGCCCGA ACAACACCAG CAACTTCGAC GGCGGCCTCG GCTCCTACCA GGACGGCATG AAGCCCTGCC CGCCCGGCGG CACTGACCCC GACACCAAGT TCACCGGCCA CAACAGCCGC TTCTCCGACG ACGTCCGCTC CTGGCAGACC GACGAGCCGG CCCTGGACAT GACCGGCTCG GCAGTCCTCG GCGCCGCGAT GCAGGAGACC CTCGGCGGCT GA
|
Protein sequence | MVSRARPRFR HRRTVAALSA VSLAGLTVGT ATVLSQPAQA AQSANTGLVR VDQAGYLAGD VKQAYLMTGG AVSGAKFSVL NAKGKTVLTG KVGGTSLGKW NAAYPDVYPI VFSGLKTPGT YHIAVAGSAS GSSPTFTVTS SGSLYGKLVT DGVTFFQTQR DGSNVVPGAL NRKPSHLNDA AASLYAWPTF APDDSDTITD ADLTKLGGTV DVSGGWFDAG DYLKFSNNEA FGDITLLAAQ RALGSSAPAS LTAEAHYGET WLNKAWNQKT KTLVLQVGIG SGNAAGTFTG DHDLWRLPQK DDGDTATADR YSAAHRPAFL AASPGARISP NIAGRVAAAF ALAAQVDAKS NPKQAAAEYQ AAASVYAQAD TSAPPSPLTT ALPNGYYPES IWHDAMELGG AELALAAQKL GHSPSSYLSQ AATYAKDYIA SDTGDTFNLY DNSALAHADL IKAIAAAGNP SGLAVTRAAL TADLKRQVQS AASKASSDVF HAGGDYADFD VNAHTFGFLT EEALYRQASG DTSFQSFATE QRDWLLGANA WGQAFMVGEG STFPKCMQHQ VANLSGSLNG TGAIATGAVM NGPNNTSNFD GGLGSYQDGM KPCPPGGTDP DTKFTGHNSR FSDDVRSWQT DEPALDMTGS AVLGAAMQET LGG
|
| |