Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4932 |
Symbol | |
ID | 8336286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5627728 |
End bp | 5629719 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958031 |
Product | Alpha-L-fucosidase |
Protein accession | YP_003115633 |
Protein GI | 256394069 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3669] Alpha-L-fucosidase |
TIGRFAM ID | [TIGR01586] cysteine protease domain, YopT-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00307055 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCCT CTCCTCACCC CATCAGCCGC CGCTCCCTCC TCGCCGCCGG AAGCGCCGGC ACAGCCGCCG CGTTCGGCCT GCTGCGTTTT GCCCCCTCCG CGTCCGCCAC CGTCGGCCCG CAGAGCTACA CGGCCGCCTG GCCGTCGGTG GATCAGCACC CGCCGGCGCC CGAGTGGTTC CAGGACGCCA AGTTCGGGAT CTACTACCAC TGGGGCGTGT TCTCCGTCCC CGCCTTCGGC AACGAGTGGT ACCCGCGCAA CATGTACAAC TCGGGCTCGG CCGAGAACCA GCACCACATC GCCACCTACG GGGACCCGTC GGTGTGGCCG TACCACAACT TCATCCTCGG GGCAAACGAC AAGTCGGGCC ACTTCACCAA GTTCGCGCCC AAGCTGACCT CCGCCGGCGG CAGCTGGGAT CCGAACGCCT GGGCCCAGCT GTTCAAGAAC GCCGGCGCGA GGTTCGCCGG TCCCGTGGCC GAGCACCACG ACGGCTACTC GATGTGGAAC AGCCAGGTCA ACGAGTGGAA CTCGGTGAAG ACCGGTCCGA ACCTGGACCT GGTGAACCTG CACGCCCAGG CGATCCGCGC GCAGGGGTTG AAGTTCATGA CCTCCCTGCA CCACGCCTAC CACTTCCAGG GCTACTACCA GTACGTCCCC CAGCAGTCGA CCGCCACGCT GCAGAAGCTG TACGGGCAGA CGGGGACCAC GGCAGAGAAC CAGCTCTGGT ACAACAAGCT GGTCGAGGTG ATCAACGGCT ACCAGCCGGA CCTGATCTGG CAGGACTTCG ACCTGAACGG CGTCCAGGAA TCCGAGCGCC TGGCCTTCCT GTCGTACTAC TACAACAAGG CCGTGTCCTG GAACAAGGAC GTCGTGGCCA CCTACAAGGA CGGCTTCGAC ACCTCCGGCG AGGTCTTCGA CTTCGAGCGC GGCGGTCCCG GCGGCCTGCT CACGCCGTAC TGGCTCACCG ACGACTCCAT CTCCTCCTCC AGCTGGTGCT ACACCGTCGG CATCGGCTAC TACACCGGCC AGGCACTGCT GCACGCGCTC ATCGACCGGG TGGCCAAGGG CGGCAACATG CTGCTGAACA TCGCCCCGAT GGCCGACGGC ACCATCCCCT CGGGCCAGCA GAGCCTCCTG CTCGGCATGG GCGACTGGCT CGGCCGCTTC GGCGAGGCGA TCTACGCCAC CCGCGCGTGG ACCTCCTACG GCGAGGGTCC CACCGCGATG GGCGGCGGGT CCTTCTCCGG CCCGAAGGCG GGCACGCCGC AGGACGTCCG CTTCACCCGC AGCAAGGACA ACACTGTCCT GTACGCCACC GCCCTCGGCT GGCAGGGCAG CACCATGACG GTCACCACGC TGAACTCCAA CCAGATCAAT CTCAGCACCC TGGTCTCCGC GCAGCTGCTG AACAACGCCG CGGGGACCTA CGTCAACCTG CCCTCCCCGG TCCAGGACTC CGGCGGCCTG CACTTCTCGA TGCCCTCGTC CAACGCGCCC TTCTCAGCCC TGGCCTACAC GGTCAAACTG ACCTTCTCCG GCCAGATCCC GACCCTGGGC GGCGGCGGAT CCGTCCCGAC CGGCTGGTCG AAGATCGTCA ACGGCGCCAC CGGCCTGGTC CTGGACAGCG GCGGCAGCGT CGCCTCCGGC TCGAACCTGA AGCAGTGGAA CTACGACGGC AGCACCAACC TGCAGTGGCA GCTCGTGCCG CTCGGCGGCG GCTACTACCG CATCGTCAAC AACACCAACG GCATGGTCGC CGACAGCTGG GGCAACACCG CCAACGGCGC ACCGGCCCGC CAATCCGCCT GGAACGGCGG CAACAACCAG CAATGGAGCC TCACCAGCGC CGGCAGCGGG CGCTACCACA TCGTCAACCG AGGCACCGGC ACCGCGCTGG ACGGCTCCGG CAGCACCACG GCGGGTTCCA CCGCCGTCCT GTGGAGCCCC AACTCCAGCC CCAACAACGC GTGGTCGATC GTCGCGATCT GA
|
Protein sequence | MSSSPHPISR RSLLAAGSAG TAAAFGLLRF APSASATVGP QSYTAAWPSV DQHPPAPEWF QDAKFGIYYH WGVFSVPAFG NEWYPRNMYN SGSAENQHHI ATYGDPSVWP YHNFILGAND KSGHFTKFAP KLTSAGGSWD PNAWAQLFKN AGARFAGPVA EHHDGYSMWN SQVNEWNSVK TGPNLDLVNL HAQAIRAQGL KFMTSLHHAY HFQGYYQYVP QQSTATLQKL YGQTGTTAEN QLWYNKLVEV INGYQPDLIW QDFDLNGVQE SERLAFLSYY YNKAVSWNKD VVATYKDGFD TSGEVFDFER GGPGGLLTPY WLTDDSISSS SWCYTVGIGY YTGQALLHAL IDRVAKGGNM LLNIAPMADG TIPSGQQSLL LGMGDWLGRF GEAIYATRAW TSYGEGPTAM GGGSFSGPKA GTPQDVRFTR SKDNTVLYAT ALGWQGSTMT VTTLNSNQIN LSTLVSAQLL NNAAGTYVNL PSPVQDSGGL HFSMPSSNAP FSALAYTVKL TFSGQIPTLG GGGSVPTGWS KIVNGATGLV LDSGGSVASG SNLKQWNYDG STNLQWQLVP LGGGYYRIVN NTNGMVADSW GNTANGAPAR QSAWNGGNNQ QWSLTSAGSG RYHIVNRGTG TALDGSGSTT AGSTAVLWSP NSSPNNAWSI VAI
|
| |