Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4086 |
Symbol | |
ID | 5901548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4431428 |
End bp | 4433761 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564606 |
Product | glucan 1,4-alpha-glucosidase |
Protein accession | YP_001685708 |
Protein GI | 167648045 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | [TIGR01535] glucan 1,4-alpha-glucosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTGCC TGAAGATCCT GGCCCTGTCC TCGACCGCCG CCGTCGCCCT GGCGGGCGCG GCCCGCGCCG AAGCTCCGAC CTCCTGGGCC TATGCCGCCA AGACCGGGGT CGGCGCGTCC TACGAGGCCT ATGTCGACGG CGCCTACAAG GACGGCGGGC CGACCGGTCC GGTGTCGAAG GTCTGGTTCT CGATCGCCGA CGGAACCCTG ACCGAGACCA TGTACGGCCT GATCCACGAG GCCCAGATCA AGCAGATGCG CGTGGCGGTG AAGACCGCGA CCGGCCTGGC CGTCGAGGGC GCTGACACTA CATCGAAGAC CGAGTACCTG CACGTCGACG CCGCCGGCCG CCCGCTGTCG CCGGCCTACA AGGTGACCAC CACCGACAAG CAGGGCCGCT TCCAGATCGA GAAGCGGATC TTCACCGACC CCGACCACAA CAGTCTGTTC GTGCGGGTCA CCGTCCGCGC CCTGAAGGGG CCGATCACGC CGTTCCTCGT GCTGGAGCCC CACATGGCCA ACACCGGCGG CGGCGATGTC GGCTCGGCCG GCGGCGGGGC GCTGACGGCC CATGAGGGCA AGTTCTTCCT CAGCCTCAAG GGCCAGCGAT CGTTCGTCAA GGCGGCCGCC GCGCCGCTGA AGGACGGCGA CGCCCTGGCG ATCTTCAAGG ACGGCGCCTT GGTCGGCGCG GCCGAGGCCA AGGGCGCCAT CGTCCTGGCC GGCCAGCTTC CGACCCAGGC CTCGGGCGAG GCGACCTACG ACTTCGTCAT CGGCTTTGGC GACAGCATCG GCGCCGCCGA CAGGGCCGCG TCGGCCACGC TGAGCACGGG CTACGCCGAA GTGCTGGCCC GCTACAACGG CGAGGGCGAC CGCGTGGGCT GGGAGGACTA TCTGGCCTCG CTGACCGAGC TGCCGCGCCT GCGCAAGGCC TCGGAGGATG GCGGCAAGCT GGTCCAGGCC AGCGCCCTGA TGCTGAAGGT GCAGGAAGAT CGCACCTATG CCGGGGCCCT GATCGCCTCG CTGTCCAATC CCTGGGGCGA CACGGTGGAC GCCTCCAAGC CATCGACCGG CTACAAGGCC GTCTGGCCGC GCGACTTCTA CCAGTGCGCC ATGGCCCTGG CGGCCCTGGG CGACAAGCAG ACGCCGCTGG CCGCCTTCCA CTATCTGCCG CGGGTCCAGG TCAAGGCGAC CACGCCGGGC AATACCGGGG TCGGCGGCTG GTTCCTGCAG AAGACCTGGG TGGACGGCAC CCCCGAATGG GTCGGCGTCC AGCTGGACCA GACCGCCATG CCGATCATGC TGGGTTGGAA GCTGTGGAAG CTGGGCTGGC TGCCCGAGGC CGACCTGAAG ACCTACTATG GCAAGATGAT CAAGCCGGCC GCCGACTTCC TGGTCGATGG GGGCAAGGTC GGGGTTGGTT GGAACCACGA GACGATCAAG CCGCCCTTTA CCCAGCAGGA GCGCTGGGAA GAGCAGGGCG GCTATTCGCC CTCGACCACG GCGGCGACCA TCGCCGGCCT GGTGGTGGCG GGCGACATCG CCGAGCTGGC GGGCGACACG GACGGCGCGG CCCGCTACCA CGCCACGGCC GACGCCTATT CGGCCAAGGT CGAGGCCCGG ATGGTCACCA CCAAGGGACC GTTCGGCGAC GGGACCTACT ATGTGCGCCT CAACCAGAAC GAGGATCCCA ACGACCACGC CCCGATCGGC GCCGCCAACG GCCAGATCGC CCCGCCCAAG GACCAGGTGG TCGATGGCGG CTTCCTGGAG CTGGTCCGCT ACGGCGTGCG CCGGGCCGAC GATCCGGCCA TCGTCGGCAG CCTCCCGGAG CTGGACGACA CCACGCGGGC CGACCTCTAT CGCGTCCGTT ACGACTTCAC CTTCCCGAGC GTGAAGGGCG ACTATCCGGG CTGGCGGCGC TACGACGTCG ACGGCTATGG CGAGGACGCC AAGACCGGGG CCAACTACGG CGTGGGCGGC CAGATGAGCC CGGGCCAGCG CGGCCGGGTC TGGCCGATCT TCACCGGCGA ACGCGGCCAC TACGAGCTGG CGCTGGCCAG CTTGCACGGC AAGCCGAGCG CGGCGGCCGT GCGGCGGATC CGCGACCGCT ACGTCAAGGC CATGGAGCTG TTCGCCAATG ACGGCCTGCT GATTTCCGAA CAGGTCTGGG ACGGCGTCGG ACAAAACCCG CGCGGCTATG AACGCGGCGA GGGCACGGAC TCGGCCACCC CCCTGGCCTG GTCGCACGCC GAATACGTCA AGCTGCTGCG CTCGGTCAGC GACGGCGAGG TGTGGGACCG CTATGCGCCG GTGGCGGCGC GCTACGCGAA GTAG
|
Protein sequence | MRCLKILALS STAAVALAGA ARAEAPTSWA YAAKTGVGAS YEAYVDGAYK DGGPTGPVSK VWFSIADGTL TETMYGLIHE AQIKQMRVAV KTATGLAVEG ADTTSKTEYL HVDAAGRPLS PAYKVTTTDK QGRFQIEKRI FTDPDHNSLF VRVTVRALKG PITPFLVLEP HMANTGGGDV GSAGGGALTA HEGKFFLSLK GQRSFVKAAA APLKDGDALA IFKDGALVGA AEAKGAIVLA GQLPTQASGE ATYDFVIGFG DSIGAADRAA SATLSTGYAE VLARYNGEGD RVGWEDYLAS LTELPRLRKA SEDGGKLVQA SALMLKVQED RTYAGALIAS LSNPWGDTVD ASKPSTGYKA VWPRDFYQCA MALAALGDKQ TPLAAFHYLP RVQVKATTPG NTGVGGWFLQ KTWVDGTPEW VGVQLDQTAM PIMLGWKLWK LGWLPEADLK TYYGKMIKPA ADFLVDGGKV GVGWNHETIK PPFTQQERWE EQGGYSPSTT AATIAGLVVA GDIAELAGDT DGAARYHATA DAYSAKVEAR MVTTKGPFGD GTYYVRLNQN EDPNDHAPIG AANGQIAPPK DQVVDGGFLE LVRYGVRRAD DPAIVGSLPE LDDTTRADLY RVRYDFTFPS VKGDYPGWRR YDVDGYGEDA KTGANYGVGG QMSPGQRGRV WPIFTGERGH YELALASLHG KPSAAAVRRI RDRYVKAMEL FANDGLLISE QVWDGVGQNP RGYERGEGTD SATPLAWSHA EYVKLLRSVS DGEVWDRYAP VAARYAK
|
| |