Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3277 |
Symbol | |
ID | 5900732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3539579 |
End bp | 3542503 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641563783 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001684902 |
Protein GI | 167647239 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCGT CTTGGACGAT GACGGCGGCG GCGATCGCCC TGTTCGCGGC CGCCCCCGCG TTCGCGGGGG GGCCGTCTCC GCCCACCGGT TTTGACCCTC TGGGCCAACT GCCGGCCCTG GCGCCCGCGA CTGGCGCGGC GTTCCGCCTC CAGCAGCTGC GCAACGGCGT TCAGTTCCAG TTGAACGGCG TCACCAAGTC GGTGCTCTTC TATTCGCCGA CGGTCGTGCG GGTGAACGCC AACCTGGGTC AATCCTACTG GAGCGCGCCA AGCCTCGTGG TGATCCGCCG ACCCGAGAGC GTTCCTTTCA CCGTGCGCGA GACCCCGGCC GCGGTGGAGC TGGTCGGCCA GAAGCTGCGC GTCGTCATCG ACAAGCACGC GGGCGCCTTG ACCTATTACG ACGCCGCGGG TCGGCTGCTC ACCCGCGAGA AGGCCGACGC GCCCCAGTCG ATCGACAAGC GCACCATCGC CGAGGCGCCG ACCTACGAGG TGGAGAACCG CTTCACGCTC AAGCCGGGAG AAGGCGTTTA CGGCTTTGGC TTCACCGGCG ACGACGAGGT CAACCGCCGA AACAAGGACT TGCTGCTGGT CCAGACCAAC GTCGGGATCA TCATTCCGGT GATGGTCTCG ACCGAGCGCT ACGGCGTGCT CTGGGACACC TATTCGCAGA TGCGCTTCAA AGACGACGAC CAGAGCGCGC GTCTCTGGGC GGAAAGCGCG CCGGGCGGGG TCGACTACTA CTTCATGGCC GGCGACACGA TGGACGGCGT CGTCGGCGCC TATCGCTGGC TGACGGGCGA TGCGCCGATG TACCCCAAGC AGGCCTTCGG CTTGTTCATG AGCAAGGAGC GCTATCCCAC CGAGGAGCGT CTCTTGGAGG TCGCCAGAAC CTTCCGCAAG GAGAGCTTCC CGCTCGATTA CATCGTCCAG GACTGGCAAT ACTGGGGCGG CGACGAGGAC GGAACCTGGT CGGGCATGAC CTGGGATGCC AAGCGGTTTC CCGATCCGGT CGGTATGACG CGGCAGGTCC ACGACCTTCA CATGAAGCTG ATGGTGTCGA TCTGGCCGTC GATTGGCAAC GACACCGCGC TCGGGCGCGA ACTGGACCAA CACGACCTGC GCTTCAAGCC CCTGCACTGG ATCTCCAAGC AGGCCCGCGT CTACGACGCC TACAGCCCGC TGGGCCGAGA AATCTATTTC AAGCACATCA AGTCCGGCCT GCTCGACAAG GGTGTCGACG CGCTATGGAT GGACGGGACC GAGGTCGAGG TCAGCTCGGC GATGTGGAAT CCGGCCGACA ACATCCGCGA CACCAAGGCG CTGGGCGCCA ACGCGATGGG TGACTTCACC CGGTACCTGA ACCCGTATTC GCTGCTGACC ACCGAAGGGA CCTACAAGGG CCAGCGCGCC ACCAGCGACA AGCGGGTCTT CACCCTGACG CGATCGGCCT GGGCGGGCGC CCAACGCACG GCCGCCGCGT CTTGGTCAGG CGACATCTAC GCCAGTTGGA AGACCTTCGC CCAGCAGGTG GCGGGAGGCG TCAACGTCAC GATCACCGGC AACCCCTACT GGACCCAGGA CACCGGCGGC TTCTTCGTCT CCGACTTTCC AGGCGGCGAG AAGAACCCCG CTTGGCGCGA GCTCTATGCC CGCTGGCTCC AGTACGCGGC CTTCAACCCG ATCATGCGCA TCCACGGCAC GAGCGTCGAG CGCGAGCCCT ATCTGTTCAA GACCCTGGAT CCGCCGGTCT ACAAGGCGCT GCTGGACGCC ACCCGGCTTC GGTACCGGCT ACTGCCCTAT ATCTACAGTC TCAGCGCCAA GGTCACGGCC GATCGCTATA CCCTGATGCG CCCGCTGCCG ATGGACTTCC CCAAGGATCC GGCGACCTAC AACATCAATG ACAGCTTCAT GTTCGGCTCG TCACTGCTCG TCCATCCCGT GACGCGCGCG ATGTACAATA TCCTTCCGTC CTCGGCGACG ACGGTTCCCG GGCGATATCT GCGGACCTCG GACGGCAAGC CCGGACTGAC CGTCCAATAT TTCGCCGGCG AGAACTTCGA GACGCCGCGC GGTCAGGCGG TCGACACGAA GATCGACCAC ACCTGGCCCG ATCCTCCGCT GGCGGAGCTT CCGCCCGGCC TGACCTCGCT GAGCCACTTC TCCGTCCAGT GGGACGGCGA TCTGATCGCG CCGGAGGATG GCGAATACGA AATCGGGCTG ACAGGCAACG ACGGCATGCG CATGGCGCTG GCCGGCGACA CGGCGATCGA CGAGTGGCGG CGCGCGGCGA CCCGCACGCG CGTGGTGCGG CGAACGCTCA AGGCTGGCGA AAAGCTGCCA GTCCGGCTGG AATTCTCCCA TCCCGAGGGC GGCCGCGTGT TCCGTTTCGT CTGGCGCACG CCCAGCGAGC TGAAGCGGGA CGCCGCCGCG ATGAACGCGC CGCGCGACCT CACGATGCGC ACCTACCTGC CCAAGGGCGC GGATTGGTAC GATTTCTGGA GTGGCCAGCG CCATCCCGGC GGTCAGATCG TCGCGCGCCA AGCTCCGCTC GACGTCATGC CGCTCTATGT CCGGGCCGGC GCGATCATCC CGATGGGGCC GGTCCTGCAG TACGCGACCC AGCACCCCGA TGCGCCCTAC GAGATCCGCG TCTATCCGGG CGCGGACGGC AAGTTCACGA TCTATGAGGA CGACAACGAA ACCTACGCCT ACGAGAAAGG TCAATCGGCG CGTTACGACC TCAGCTGGAA CGATGCGACG CGGACCTTGA CGGTGGGACC TCGGCGCGGC GCGTTCGCCG GCATGGTCAA GCAGCGAACG CTCAAGATCG TGGTGATCGG CGCCGCCGGG GCAAGCACGC TCACGCCGCC ATCGACCGAT CGTACAGTGT TGTATGGTGG ACGAGCCTTG ACCGTACGGT ACTGA
|
Protein sequence | MKASWTMTAA AIALFAAAPA FAGGPSPPTG FDPLGQLPAL APATGAAFRL QQLRNGVQFQ LNGVTKSVLF YSPTVVRVNA NLGQSYWSAP SLVVIRRPES VPFTVRETPA AVELVGQKLR VVIDKHAGAL TYYDAAGRLL TREKADAPQS IDKRTIAEAP TYEVENRFTL KPGEGVYGFG FTGDDEVNRR NKDLLLVQTN VGIIIPVMVS TERYGVLWDT YSQMRFKDDD QSARLWAESA PGGVDYYFMA GDTMDGVVGA YRWLTGDAPM YPKQAFGLFM SKERYPTEER LLEVARTFRK ESFPLDYIVQ DWQYWGGDED GTWSGMTWDA KRFPDPVGMT RQVHDLHMKL MVSIWPSIGN DTALGRELDQ HDLRFKPLHW ISKQARVYDA YSPLGREIYF KHIKSGLLDK GVDALWMDGT EVEVSSAMWN PADNIRDTKA LGANAMGDFT RYLNPYSLLT TEGTYKGQRA TSDKRVFTLT RSAWAGAQRT AAASWSGDIY ASWKTFAQQV AGGVNVTITG NPYWTQDTGG FFVSDFPGGE KNPAWRELYA RWLQYAAFNP IMRIHGTSVE REPYLFKTLD PPVYKALLDA TRLRYRLLPY IYSLSAKVTA DRYTLMRPLP MDFPKDPATY NINDSFMFGS SLLVHPVTRA MYNILPSSAT TVPGRYLRTS DGKPGLTVQY FAGENFETPR GQAVDTKIDH TWPDPPLAEL PPGLTSLSHF SVQWDGDLIA PEDGEYEIGL TGNDGMRMAL AGDTAIDEWR RAATRTRVVR RTLKAGEKLP VRLEFSHPEG GRVFRFVWRT PSELKRDAAA MNAPRDLTMR TYLPKGADWY DFWSGQRHPG GQIVARQAPL DVMPLYVRAG AIIPMGPVLQ YATQHPDAPY EIRVYPGADG KFTIYEDDNE TYAYEKGQSA RYDLSWNDAT RTLTVGPRRG AFAGMVKQRT LKIVVIGAAG ASTLTPPSTD RTVLYGGRAL TVRY
|
| |