Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5224 |
Symbol | |
ID | 8336578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6012541 |
End bp | 6014241 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958322 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_003115924 |
Protein GI | 256394360 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.105995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATC GCAGAAAGCT GCTGTCCTCC GCCGTCGCCC TCGGCGGCTC CGCACTGGGC CTGGGCACCG GCGCCCTGGC CTGGCGGGCT TCGGCGACCA CCCCCACCTT GCAGGTCGCC CTGCAGAACA CCACGACGTC GAACCAGGTC TACGCCTACG TCACGGGCCA GGCGATCGAC AACAACAATG CCCTGATGCT CTTGGAGGCC GACGGGCACA CGGTCTACTA CCCGACCTCG CCGAGTTCGA CCGGCTCCCC GCTGGCCGCG GACTGCGCGA TCCGGCTCGG CGCGCCGGGC AGCACCACCA CGATCACCAT CCCGCACATC GCCGGCGGCC GGATCTGGTT CGCCATCGGA GCGCCGCTGA CCTTCCTGCT CAACCCGGGA CCGGGTCTGG TCGAGCCCTC CGTGAGCAAC CAGTCGGACC CCAACATCAA CATCCGCTGG GACTTCTGCG AGTTCACGTA CAACGCCGCG CAGATGTTCG CCAACATCAG CTACGTCGAC TTCGTCTCGA TCCCGATCTC GCTGGCGCTG ACCAACGGCT CCGGCGCCAC GCAGACCGTC AGCGGCCTGC CGACCAACGG TCTGGACACG GTCTGCTCGA ACCTGAACGC CCAGCACGCC GCGGACGGCG CCGGCTGGAA CCAGCTGGTC GTCACCTCCG GCGGCGCCAA TTTGCGCGCG CTGAGCCCGA ACAACGGCAT CGTGATCAAC AACTCGCTGT TCTCGGGGTA CTACCAGCCC TACGTGGACC AGGTGTGGTC CAAATACTCG AGCCAGGCGC TGGCCGTGGA CACCCAGGCC TCCTGGGGCA CCGTCAACGG CCAGGTCTCC GGCGGGACGT TGACCTTCCC GGGGCTGGGA AGCTTCGCCA AACCCTCGGC CGCCGACATC TTCAGCTGCA GCACCGGGCC GTTCGCCAAC ACCGCCGGCG CGATGGGCCC GCTGGTCGCC CGCATCAGCG CCGCCTTCAA TCGCAGCACC CTGCTGATCG ACGCCACCCA GCCCGACGGC GAGAACCCCG CGAACTACTA CAAGAACGCG ATCACCAACC ACTACTCGCG GATCGCGCAC GCGGCGAACC TGGACAGCCG CGGCTACGGC TTCCCCTACG ACGACGTGGC GCCCAACGGC GGCGCCGACC AGTCCGGCGC GGTCTCCGAC GGCAACCCGA CGCTGCTGAC GGTGGCCGTC GGCGGCGGCA CGGCCACCGG CCCGGGCGGC GGCGGACCGA GTCAGCCGTC GAGCCCGAGC AGCACCGGCG GCGGAGGCGG CGGCACGGTC AGCGCCTTCA CCACGATCCA GGCGGCGAGC TACAGCTCGC ACAACGGCAC GCAGAACGAG ACCACCAGCG ACACCGGCGG CGGCCAGGAC GTCGGCTGGA TCGGAGGAGG CGACTGGCTC GCCTACGCCA ACGTCGACTT CGGCAGCGCG GGCGCGACGC AGTTCAAGGC CCGGGTCGCC TCCGGCGCCG CGGCAGGTGT CAGCGGTCTG ATCAAGGTCG CGCTGGACAG CCCGACGGCT GCGCCGGTCG GCAGCTTCGC CGTCGGCAAC ACCGGCGGCT GGCAGACCTG GCAGACCGTG CCGGCCAACA TCAGCAAGGT CACCGGAAAG CACACGGTCT ACCTGGTGTT CTCCAGCGGC CAGCCCGCCG ACTTCGTGAA CGTGCACTGG TTCACCTTCA GCCAGACCTG A
|
Protein sequence | MMNRRKLLSS AVALGGSALG LGTGALAWRA SATTPTLQVA LQNTTTSNQV YAYVTGQAID NNNALMLLEA DGHTVYYPTS PSSTGSPLAA DCAIRLGAPG STTTITIPHI AGGRIWFAIG APLTFLLNPG PGLVEPSVSN QSDPNINIRW DFCEFTYNAA QMFANISYVD FVSIPISLAL TNGSGATQTV SGLPTNGLDT VCSNLNAQHA ADGAGWNQLV VTSGGANLRA LSPNNGIVIN NSLFSGYYQP YVDQVWSKYS SQALAVDTQA SWGTVNGQVS GGTLTFPGLG SFAKPSAADI FSCSTGPFAN TAGAMGPLVA RISAAFNRST LLIDATQPDG ENPANYYKNA ITNHYSRIAH AANLDSRGYG FPYDDVAPNG GADQSGAVSD GNPTLLTVAV GGGTATGPGG GGPSQPSSPS STGGGGGGTV SAFTTIQAAS YSSHNGTQNE TTSDTGGGQD VGWIGGGDWL AYANVDFGSA GATQFKARVA SGAAAGVSGL IKVALDSPTA APVGSFAVGN TGGWQTWQTV PANISKVTGK HTVYLVFSSG QPADFVNVHW FTFSQT
|
| |