Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1291 |
Symbol | |
ID | 5898746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1360330 |
End bp | 1361994 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641561776 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001682919 |
Protein GI | 167645256 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.186796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0579695 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGGAACC AAGATAGAGT TCGACCGGGA CGGATCCGGT CCTTGGCGAT GATCGCGGCG CTGCTCGCCG CCACCGCCTC GGCCGGCATG GGGGCCTCCA CTCCAGCCGC CGCCGCCGAC CTGCCCAAGT TCGTCGCCAA GGACGGTCGC CACGCCCTGA TGGTCGATGG GGCGCCGTTC CTGATGCTGG GCGTGCAGGT CAACAATTCC AGCAACTACC CTTCGCAACT GCCCAAGGTC TGGCCGGCGG TGAAGGCGCT GCAGGCCAAC ACCGTCGAGG TCCCGATCGC CTGGGAGCAG ATCGAGCCGG TCGAGGGCAG GTTCGACTTC TCGTTCCTCG ACGTGCTGCT CAAGCAGGCC CGCGAGAACG ACGTCAAGCT GGTGTTGCTG TGGTTCGGGA CGTGGAAGAA CAACGCCCCC AACTACGCGC CCGAGTGGGT CAAGCTGGAC AATACGCGCT TTCCGCGGGT GGTCACCGCC AAGGGCGAGA CCCGCAACTC GCTGTCGCCG CACTTCCCCG CCACCCTGGA GGCCGACAAG AAGGCCTTCG TGCAGTTGAT GCGCCACCTG AAGGCCGCCG ATCCGGACCA CACGGTGATC CTGGTCCAGC CCGAGAACGA GACGGGCGTC TACAGCGCAG TTCGCGACTA TTCCCCCGCC GCCCAGAAAC TGTTCGAGGG TCCTGTTCCG GCCGAACTGG TCAAGGCGAT GGGCAAGACG CCAGGAACCT GGAGCCAGGT GTTCGGCAAG GACGCCGACG AATATTTCCA CGCCTGGTCG ATCGGCCGCT ACGTCGATCA GATCGCCGCC GCCGGCAAGC GCGAGCTGGC GCTGCCGATG TATGTCAACG CCGCCCTGCG CGATCCCTTC AAGGACCAGG ACCCCTACAC CTACTCGTCG GGCGGACCGA CCTGGAACGT GCTCGACGTC TGGAAGGCGG CGGCGCCGTC GATCGACGCC ATCGCGCCGG ACATCTACAT GCGCGAGAGC AGCAATGTCC GAAAGACGCT GGCGCAGTAC GGCCGGCCCG ACAATCCGCT GTTCGTGCCG GAGATCGGCG ACGACAAGGG GTTCGCGCGC TACTTCTACG ATGTGCTGGG CGCCCACGGC CTGGGCTTCT CACCGTTCGG CCTGGATCAG ACCGGCTATT CCAACTACCC GCTCGGCGCC AAGACGGTCG ACGCCCAGGC GCTGGAGACC TTCGCGGTTC ACTACCGCCT GCTGGCCCCG ATGGCTCGCC AATGGGCCAG GCTTTCCTAC GAGGGCAAGG TCTGGGGCGC CGGCGAGGCG GATGACCGCA AGGCCGAGAC CCTGAAGCTG GGCGATCGCT GGACCGCCAC CCTGTCCTAC GGCGAATGGC AGTTCGGCTC GATCGAGGCC CCCTGGATGG CCAAGGCCGA AAAACAACCC AATCGCGAGG TCCCCGACGG CGGCGCCCTG ATCGCCCAGC TGTCACCCAA TGAGTTCCTG ATCACCGGCT ACCGCGCCCG TGTCAGCTTC GGTTCGGCCA AGGGCGAGCG GATGCTGATG GCGCGGGTCG AGGAAGGCCA TTTCGAGAAC GGCCAGTGGG TCTTCGATCG CCTGTGGAAT GGCGACCAGA CCGACTACGG CCTGAACCTG ACGACCTTGC CGCAAGTGCT GAAGGTCAAG CTGGCCACGT ATTAG
|
Protein sequence | MRNQDRVRPG RIRSLAMIAA LLAATASAGM GASTPAAAAD LPKFVAKDGR HALMVDGAPF LMLGVQVNNS SNYPSQLPKV WPAVKALQAN TVEVPIAWEQ IEPVEGRFDF SFLDVLLKQA RENDVKLVLL WFGTWKNNAP NYAPEWVKLD NTRFPRVVTA KGETRNSLSP HFPATLEADK KAFVQLMRHL KAADPDHTVI LVQPENETGV YSAVRDYSPA AQKLFEGPVP AELVKAMGKT PGTWSQVFGK DADEYFHAWS IGRYVDQIAA AGKRELALPM YVNAALRDPF KDQDPYTYSS GGPTWNVLDV WKAAAPSIDA IAPDIYMRES SNVRKTLAQY GRPDNPLFVP EIGDDKGFAR YFYDVLGAHG LGFSPFGLDQ TGYSNYPLGA KTVDAQALET FAVHYRLLAP MARQWARLSY EGKVWGAGEA DDRKAETLKL GDRWTATLSY GEWQFGSIEA PWMAKAEKQP NREVPDGGAL IAQLSPNEFL ITGYRARVSF GSAKGERMLM ARVEEGHFEN GQWVFDRLWN GDQTDYGLNL TTLPQVLKVK LATY
|
| |