Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3397 |
Symbol | |
ID | 5900852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3667966 |
End bp | 3669414 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641563903 |
Product | beta-galactosidase |
Protein accession | YP_001685022 |
Protein GI | 167647359 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0879388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGATA AAGGCATTAG CCGTCGGGCG CTGGGGGCGC TCGCGGCGGG CGGCGTGGCG AGCGTCGGGC TGAGCGGCTG CGATCGGGTC GGCGCGACCG AGGCGACGGT CAAGTCACGG CAATTTCCGG CGGATTTCGT CTGGGGCGTG GCCACGGCGG CCTTCCAGAC CGAGGGCTCG CCGACCGCCG ACGGGCGCGG ACCCAGCATC TGGGACACCT TCCAGAACCA GCCGGGCCGC ATCAAGGACG GCTCCACCGC CGACGTCGCC ACCGACAGCT ATCGCCGCTA CGCCGAGGAT GTCGATCTGA TCGCCGGGGC GGGATTGAAG GCCTTCCGCT TCTCGATCGC CTGGTCGCGG GTGCTGCCGA CCGGGGAGGG CACGGTCAAC GCGGCGGGGC TGGACCACTA TGACCGCCTG GTCGACGCCT GCCTGGCCAA GGGGATCACC CCCTACGCCA CGCTGTTTCA CTGGGACCTG CCCCAGGCCC TGCAGGACAA GGGCGGCTGG AGCGCGCGCG ACACCGCCAG CAGCTTTGGC GACTACGCCG CCGCCGTGGC GGCGCGGCTG GGCGACCGGC TCAAGCACGT CATCACCCTG AACGAACCGG CCGTGCACAC GGTGTTCGGC CACGTGCTGG GCGAGCATGC CCCGGGGCTG AAGGACATCG CCCTGCTGGG GCCGACCACC CACCACATGA ACCTCGGACA GGGGCTGGCG ATCCAGGCCC TGCGCGCGGC GCGCGGCGAC CTGCGGATCG GCACGACCCA GGCCTTGCAG CCCTGCCGGG CGTCGGGCGG GCCGCTGGCG TTCTGGAACC GTCCGGCGGC GGACGGGCTG GACGCCCTGT GGAACCGCGC CTGGCTGGAT CCGCTGCTGA AGGGGACCTA TCCGGCCCTG ATGGACGACT TCCTCAAGGG CCATGTCCGC GACGGCGACC TGAAGACCAT CCGCCAGCCG ATCGACTTCC TGGGGGTCAA TTACTATGCG CCGGCCTATG TGAAGCTGGA CCTCGGCAAC GCCAGCCACA TCGCGCCGGG CTCGCCACCG AGGGGCGCGG AGCTGGACGC CTTCGGCCGC CAGATCGATC CTTCGGGCCT GGTCCAGGTG CTCGAGATGG TGCGCCGCGA CTACGGCAAT CCGCCGGTGC TGATCACCGA GAACGGCTGC TCGGACCCGT TCGGACCTGG TCCGGGCGTG ATCGACGACG GCTTCCGCGG CCAATACCTG CGCCGGCACC TGGAGGCGGT GAAGAGCGCG ACGGAGGCCG GTTCGCGGAT CGGCGGCTAT TTCACCTGGA CCCTGGTCGA CAACTGGGAG TGGGACCTGG GCTACACGTC AAAGTTCGGC CTGGTGTCGC TGGACCGCGC GACGGGCGCG CGGACACCCA AGGCGTCGTA TGGCTGGTTC AAGGGCGTGG CGGAGAGCGG GCTGCTCCCC GCCGCCTGA
|
Protein sequence | MGDKGISRRA LGALAAGGVA SVGLSGCDRV GATEATVKSR QFPADFVWGV ATAAFQTEGS PTADGRGPSI WDTFQNQPGR IKDGSTADVA TDSYRRYAED VDLIAGAGLK AFRFSIAWSR VLPTGEGTVN AAGLDHYDRL VDACLAKGIT PYATLFHWDL PQALQDKGGW SARDTASSFG DYAAAVAARL GDRLKHVITL NEPAVHTVFG HVLGEHAPGL KDIALLGPTT HHMNLGQGLA IQALRAARGD LRIGTTQALQ PCRASGGPLA FWNRPAADGL DALWNRAWLD PLLKGTYPAL MDDFLKGHVR DGDLKTIRQP IDFLGVNYYA PAYVKLDLGN ASHIAPGSPP RGAELDAFGR QIDPSGLVQV LEMVRRDYGN PPVLITENGC SDPFGPGPGV IDDGFRGQYL RRHLEAVKSA TEAGSRIGGY FTWTLVDNWE WDLGYTSKFG LVSLDRATGA RTPKASYGWF KGVAESGLLP AA
|
| |