Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2412 |
Symbol | |
ID | 5899867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2633003 |
End bp | 2634898 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562903 |
Product | Alpha-galactosidase |
Protein accession | YP_001684037 |
Protein GI | 167646374 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00268404 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.284448 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTTT TCCGTATCGC CGCCGCCGTG GCCGCCACGC TCGTCATGTC GGCGACGCAG GCCGTCGCAG ATCCCCTCGC GCCCGTCGCG CGCTGGACCG CCTACGAGCG CGCCGCGGCG CGCACCCCGC CGATGGGCTG GAACAGCTGG AACGCCTTCA CCAGCGACAT CGACGAAGAG AAGATCATGG GCTCGGCCCG GATCCTGGTG AAGGCCGGCC TGGCGGATCG GGGCTATCGC TACGTGAACA TCGATGACGG CTGGTGGCTC AAACGCCGCG CGTCGGACGG GCGCATGCTC ATCCGCGCCG AGCGTTTCCC ATCGGCGGTG ACGGCCGACG GCGGGACCAG CTTTCGCCCC CTGACCGATC GCCTCCACGC GATGGACCTC AAGGCCGGCA TCTATTCCGA CATCGGCCGC AACAGCTGCG GTCAGGTGTT CACCTCCACG TTTCCAAACC AGCCGGAGGG CGACATCCGG GAACGAGAAG TCGGTCTTTA CGGCCACGTC GACCAGGACA TCGCCCTCTA TTTCAAGGAC TGGGGATTCG ATCTGATCAA GGTCGACGGC TGCGGCGTGC GCGGCTTGCC GGCCTCCGAT CCCCGGGTGA AGGCCGGGCT CTATCGCGCG CTGGGTCCGC TGGTTGACGT CGACTCCCTG GGAAGGACGG ACGTTCCGGC CGTCCGGGAT CTCTACAAGG CGGTGGGCGC CGCCCTGGAT CGCTCCAACC CCGACGGCGA CTTCGTCTAT TCCATCTGCC TCTGGGGCGC GGCCGACGTT CGCGCCTGGG GCAAGGATGT CGGCGCGATC TCGCGAACCA GCGAGGACAT TTCGCCGACC TGGAGCCGGA TGCTGCATAA TCTCGACAGC GTCTCCCGGC GCGCCCTCTA TGCGCATCCA GGCTCCTGGA ACGATCCAGA CATGCTCTAC GTGGGCAAGG GCGATTTCGA CGAAGCGCAT CTGGTCGAGG CCCGCTCGCA TTTCGCGCTT TGGGCGATGG TCAACGCGCC GCTGATCATC GGCTACGACT TGCGCACGGC CGCGCCGGCG CTTCTGGACA TCCTGGGCGC CAAGGAGATC ATCGCGCTCA ATCAGGACTC GGCGGGCAAT CAGGCCGTGC TGGCTTTCGA TTCCGCCGAC GTCTCCATCT TCGTCAAGAC ACTGGCCGGT GGCGACAAGG CGGTGGCGAT CCTCAATCGC ACGGCCGCGC CCGCCGAGGC GGTGCTGACC GCGGATCATC TGAAGCTGCT GGGTACGGCC GATGTCGAAC TGACGGACCT GTGGTCAGGC GCGGCCACCC GTTTTCGCGC CGAACGAAAG TTCCAGCTGG CGCCGCGCCA GACGTTGATC TTCCGGGCCA AGGGCGCGCG CAAGCTGGCC GACGGCGTCT TTCTCTCCGA ACAGCCCGGG TCGGTGAATC CCGCCGTCGA CGGCGTTGTG ATCCCCCAAG CCGACCCGCT GATCCATCGT GCGATCCTGC CCTGGCGCGG AACACGCGGC GTCGGCGAAC CCCCGCGCTA CGGGGGCTGG GGCGGGGCTC AAGCCGACCG CACGCCCTAT GACCAGGAAC TGGCGATCGC CGGTCGGCGG TTCGACACTG GCCTCGGCGT CCTGGCCAAT TCGCGCTTCG AAGTGCGCAA CGGCGGCTTT CGCCGCTTCA CCGCCAGCGT CGGCGTCGAC GACTCCGCCG AGGATCGGTC ACGGCCCGTG ACCTTCTTCG TCTATGGCGA CGGCAAGCTT CTGGCGCGCT CGCGGCCGGC GAGCTTCGGC CAGCCGCCGC AGGACCTCAG CGTCGAGGTG TCGGGCGTCA AGTTGCTTGA ATTGGTCGCG CGCGTTTCTG GCCAATCGCG CCACCCAGAT TCAGTAACCT GGGGCGACGC GGCGCTGCAT CGTTAG
|
Protein sequence | MTFFRIAAAV AATLVMSATQ AVADPLAPVA RWTAYERAAA RTPPMGWNSW NAFTSDIDEE KIMGSARILV KAGLADRGYR YVNIDDGWWL KRRASDGRML IRAERFPSAV TADGGTSFRP LTDRLHAMDL KAGIYSDIGR NSCGQVFTST FPNQPEGDIR EREVGLYGHV DQDIALYFKD WGFDLIKVDG CGVRGLPASD PRVKAGLYRA LGPLVDVDSL GRTDVPAVRD LYKAVGAALD RSNPDGDFVY SICLWGAADV RAWGKDVGAI SRTSEDISPT WSRMLHNLDS VSRRALYAHP GSWNDPDMLY VGKGDFDEAH LVEARSHFAL WAMVNAPLII GYDLRTAAPA LLDILGAKEI IALNQDSAGN QAVLAFDSAD VSIFVKTLAG GDKAVAILNR TAAPAEAVLT ADHLKLLGTA DVELTDLWSG AATRFRAERK FQLAPRQTLI FRAKGARKLA DGVFLSEQPG SVNPAVDGVV IPQADPLIHR AILPWRGTRG VGEPPRYGGW GGAQADRTPY DQELAIAGRR FDTGLGVLAN SRFEVRNGGF RRFTASVGVD DSAEDRSRPV TFFVYGDGKL LARSRPASFG QPPQDLSVEV SGVKLLELVA RVSGQSRHPD SVTWGDAALH R
|
| |