Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1283 |
Symbol | |
ID | 5898738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1348404 |
End bp | 1349402 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641561768 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001682911 |
Protein GI | 167645248 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00987671 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCTCGA TCCTGAAGAC CCTGAGCGCC TGCGTCCTGC TGTCGGCTCT GGCCCTGCCG GCCGCCGCCC GCGAGTTCGC GCCGCAAGTC CAGGTCAAGG CCATGACCCG GGGCGTCAAC GTCCTCGGCT ACGATCCGAT CTGGAAGGAC CCCGCCCAGG CCCGCTTCCA GATGCGCCAC TTCAAGACCA TCAAGGACGG CGGCTTTGAC GCGGTGCGGA TCAACCTGCA CGCCTTCTCG CACATGGACT CGGGCAACCG CCTCGACCCG GCCTGGCTGA AAACCCTGGA CCAGGTGGTC GAGGCCGCCC TGGCCCAGAA GCTGACCGTC ATCCTCGACG AGCACGATTT CGGCGACTGC GGCGCGGATC CCGCCGCCTG CAAGCCCAAG CTGGTCGCCT TCTGGGGGCA GATCGGCGCG CGCTACAAGG ACGCCCCGGA CGGCGTGGTG TTCGAACTGC TCAACGAGCC GAACAAGGCG CTGACCGACG ACCTCTGGAA CGGCTGGATC GTCGAGCTGC TGGCGGTCGT CCGCGCCGAC AACCCGACCC GCAACGTCAT CGTCGGCCCC GCCTTCTGGA ACAACATCAG CCACCTGGAC CAGCTGAAGC TGCCGGAAAA CGACCGCCAC CTGATCGCCA CCGTGCACTA CTACCTGCCG ATGGAGTTCA CCCACCAGGG TGCGTCGTGG AATCCCGACG CGGCCAAGCT GGGCGTGACC TGGGGGACGG CCGCCGAACG CCGGCGGATG AAGGCCGATT TCGACAACGT CCAGGCCTGG GCCAAGAGCC ATGACCGGCC GATGCTGCTG GGCGAGTTCG GAGCCTATGA CAAGGGCGAC ATGATCTCGC GCGCCGCCTA CACCGCCGCC GCCGCCCGCG AGGCCGAAGC CCGCGGCTGG GCCTGGGCCT ATTGGCAATT CGACAGCGAC TTCATCGTCT ACGACATCAA GACGGACGGC TGGGTGGCGC CGATCCACGA GGCGCTGGTT CCGAAATAG
|
Protein sequence | MLSILKTLSA CVLLSALALP AAAREFAPQV QVKAMTRGVN VLGYDPIWKD PAQARFQMRH FKTIKDGGFD AVRINLHAFS HMDSGNRLDP AWLKTLDQVV EAALAQKLTV ILDEHDFGDC GADPAACKPK LVAFWGQIGA RYKDAPDGVV FELLNEPNKA LTDDLWNGWI VELLAVVRAD NPTRNVIVGP AFWNNISHLD QLKLPENDRH LIATVHYYLP MEFTHQGASW NPDAAKLGVT WGTAAERRRM KADFDNVQAW AKSHDRPMLL GEFGAYDKGD MISRAAYTAA AAREAEARGW AWAYWQFDSD FIVYDIKTDG WVAPIHEALV PK
|
| |