Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1302 |
Symbol | |
ID | 5898757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1374661 |
End bp | 1376283 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641561787 |
Product | glycoside hydrolase family alpha-L-fucosidase |
Protein accession | YP_001682930 |
Protein GI | 167645267 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3669] Alpha-L-fucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.186001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCA ATCGCCGCGA CCTTCTGAAC CTCGGGGCCG GCCTGGCCGC CGCCTCGGCC ACGCCCGCCT TCGCCGACGC CAGACAAGCC TTCCAGCCGA GTTGGGAGTC GCTGGCCGAG GGCTACAAGA CCCCTGATTG GTTCCGCGAC GCCAAGCTCG GCGTCTGGTC GCACTGGGGT CCGCAATGCG TGCCCGAATA CGGCGACTGG TACGGCCGCC AGATGTATAT CCAGGGCAAC GGCGTCTACG AGCACCACGT CAAGACCTAC GGCCACCCGA CGACGTTCGG CTTCATGGAG CTGATCAACC GCTGGAAGGC CGAGCGCTGG GATCCCGAAG GCCTGATGAA GACGTACCAG GCCGCCGGGG CCAAGTACTT CATGTCGATG GCCAACCACC ACGACAACCT CGACATGTTC GCCAGCCGCC ACCACGCGTG GAATACGCTG CGCGTCGGTC CAGGGCGGGA TATTGTCGGG ACCTGGGAGA AGGTCGCCCG CGCCCACGGC ATGCGGTTCG GCGTCTCCAA CCATTCGGCC CATGCCTGGC ACTGGTGGCA GACGGCCTAT GGCTACGACG CCGAGGGGCC GCTGAAGGGC CAGCGCTACG ACGCCTATCG GCTGACCAAG GCCGACGGCC AGGGCAAGTG GTGGCAGGGC CTGGATCCTC AGGAGCTCTA TACCGGCCGC AACATGGTCA TCCCCGATGG GATGGGCGGC ATCAAGGACG CCAACGCCTG GCACGACGCC CATGACGGCG AGTGGATCGA GACCGCGCCG CCGAACAATC CAGGCTTCAC GGCCAGCTGG CTGGCTCGCC AAAACGACCT GGTGGAGCGC TACAGGCCCG ACCTGGTCTA TTTCGACAAC TACACCCTGC CGCTGGGCCA GGCGGGCCTT GCCGCCACGG CCCACTATTA CAACCAGGCG CGCGCCTGGC GCGGCGCCAG CGACGTGGTG GTGACCGGCA AGAAGCTCAA CGCTCTCCAA CGCCGGGGCA TTGTCGAGGA CGTCGAGCGC GGCTTCTCCG ACCGTCTACG GCCCGAGCCC TGGCAGACCG ACACCTGCAT CGGCAACTGG CACTATGACC GCGGCCTCTA CGACCGCGAC GGCTACAAGA GCGCCAAGGA CGTGATCCAG CGGCTGATCG ACGTGGTCAG CAAGAACGGC TGCCTTCTGG TCTCCATCCC CCAGCGCGGC GACGGAACGA TCGACGACAA GGAAGAAAAG GTGCTGGCGG GCATGGCCGG CTGGATCGCC GTCAACGGTC CGGCGATCTA CGCCTCGCGG CCGTGGAGGA TCTACGGCGA GGGTCCGACC CGGCTGGTCG AGGGGATGCA GAACGAAGGC GACGCCAAGC CGTTCGAGGC GGCCGACATC CGGTTCACGA CCCGGGGCGG TGACCTGTTC GCGCTGCCGA TGGCGTGGCC GGCGGGCGAA CTGGTGATCG AGAGCCTGGC GACCTCCGGC CCGACAAGCG CTGGCGAGGT GCGGCGGGTG GAACTTCTGG GCGGCGGGGA ACTGGCTTTC GTCCGCGACG GCAAGGGCTT GCGGGTGTCG ATGCCGGATC AGCGGCCGAC GTTCACGCCG GTGGTGAGGA TTTCGGGACG GGGTTTGGTG TAA
|
Protein sequence | MSLNRRDLLN LGAGLAAASA TPAFADARQA FQPSWESLAE GYKTPDWFRD AKLGVWSHWG PQCVPEYGDW YGRQMYIQGN GVYEHHVKTY GHPTTFGFME LINRWKAERW DPEGLMKTYQ AAGAKYFMSM ANHHDNLDMF ASRHHAWNTL RVGPGRDIVG TWEKVARAHG MRFGVSNHSA HAWHWWQTAY GYDAEGPLKG QRYDAYRLTK ADGQGKWWQG LDPQELYTGR NMVIPDGMGG IKDANAWHDA HDGEWIETAP PNNPGFTASW LARQNDLVER YRPDLVYFDN YTLPLGQAGL AATAHYYNQA RAWRGASDVV VTGKKLNALQ RRGIVEDVER GFSDRLRPEP WQTDTCIGNW HYDRGLYDRD GYKSAKDVIQ RLIDVVSKNG CLLVSIPQRG DGTIDDKEEK VLAGMAGWIA VNGPAIYASR PWRIYGEGPT RLVEGMQNEG DAKPFEAADI RFTTRGGDLF ALPMAWPAGE LVIESLATSG PTSAGEVRRV ELLGGGELAF VRDGKGLRVS MPDQRPTFTP VVRISGRGLV
|
| |