Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4005 |
Symbol | |
ID | 5901467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4335344 |
End bp | 4336657 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564526 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001685628 |
Protein GI | 167647965 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA CGCGTAGGAC CGTGCTGGCG GGCGTGACGG CGGCGACGGC GATCGGGGCG GCTCCGGCCT TGGCCGCGCC GCGCCGCGCG ATGCCGAAGG GGTTCCTCTG GGGCGCGGCG ATCTCGGCGC ACCAGAGCGA GGGCAACGAC GTCAATTCCG ACAGCTGGCT GCTCGAGACC CTGCCGGAGA CGGTCTACAA GGACCCCTCG GGCGACGCCT GCGACAGCTA TCATCGCTAT GAACAGGACT TCGCGATCGC CCGCGCGATC GGGCTGAACT GCTATCGGTT CGGGATCGAG TGGGCCAGGA TCGAACCCGA ACCGGGCCGG TTCTCGCAAG CCGAACTGGA CCACTACAGG ACGGTCCTGA CCGCCTGCCG CGCGCACGGC CTGCTGCCGA TCGTCACCTA CAACCACTTC ACCGTGCCGC TGTGGTTCGC CATGCGCGGC GGCTGGGAGG CGCCCGACAG CGCCGACCTG TTCGCGCGGT TCTGCGAGCG GGCGACCCGC GCCCTGGGCG ACCTGATCGG CATGGCGTCG CCGTTCAACG AGGCCAACAT CCACCTGCTG GCCAGGATCA TGCGGATGGG CGCGACCCCC GAATACCTGG CCAAGCGGCG CGCGATGATC GCCGCCGCCG CGCGGGCGAC CAACGCGCCG GGGTTCTCGT CCATCCTGTT CGCCGACCCC GATCGCATCG ACGCCCACCT GCTCGACGCC CACGCCAAGG CCTACCAGGC GATCAAGGCG GGACCCGGCG ACTTTCCGGT CGGCGTCACC CTGACGACCC AGGCCGTCGA GGCGGTGGGC GAGGGCAGCA TCGCCCCTCG GATGGAGGCG ATGCTCTATG GCCAGTGGTG GGACGCGGTG AACGCCAGCG ACTTCGTCGG CGTGCAGACC TACACGCGGT TCCGGTTCGA CGTGAAGGGC GCCGCGCCGC CGCCGCCGGG GGCGGAGATG ACCGCCGCGG GCTACGAATA TTATCCGCGG GCGCTCGGCG ACACGATCCG CCTCGCCGCG CGCAAGACCA TCAAGCCCAT CTTCGTCACC GAAAGCGGCG TCGCCACCGA CGACGATACG CGCCGTGTCG CCTGGCTCGA CGCCAGCGTC GCCGAGATCG AGCGCTGCCT GGGCGAGGGG ATCGACGTCA AAAGCTATAT CTACTGGTCG CTGCTCGACA ATTTCGAGTG GACGCAGGGG TATGGCCAGC ATTTCGGCCT GGTGGCCGTC GACCGCGACA CCTTTGTCCG GACGCCGAAA CCCAGCGCCC AGCACTTCGC GCGGCTCGTG CGCGGCTCTC GAAACCATGG ATAG
|
Protein sequence | MATTRRTVLA GVTAATAIGA APALAAPRRA MPKGFLWGAA ISAHQSEGND VNSDSWLLET LPETVYKDPS GDACDSYHRY EQDFAIARAI GLNCYRFGIE WARIEPEPGR FSQAELDHYR TVLTACRAHG LLPIVTYNHF TVPLWFAMRG GWEAPDSADL FARFCERATR ALGDLIGMAS PFNEANIHLL ARIMRMGATP EYLAKRRAMI AAAARATNAP GFSSILFADP DRIDAHLLDA HAKAYQAIKA GPGDFPVGVT LTTQAVEAVG EGSIAPRMEA MLYGQWWDAV NASDFVGVQT YTRFRFDVKG AAPPPPGAEM TAAGYEYYPR ALGDTIRLAA RKTIKPIFVT ESGVATDDDT RRVAWLDASV AEIERCLGEG IDVKSYIYWS LLDNFEWTQG YGQHFGLVAV DRDTFVRTPK PSAQHFARLV RGSRNHG
|
| |