Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4083 |
Symbol | |
ID | 5901545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4427107 |
End bp | 4429605 |
Gene Length | 2499 bp |
Protein Length | 832 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564603 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001685705 |
Protein GI | 167648042 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGAAT TCAGCCGCCG CCAGGCCTTG GCCGCCACCG CCGCGGGCGC CGCCTTGGCC GCCACGTCTC CCACGCGCGC CGCCCCCTCG AAGGGCAAGC CGGCCCCTAT GGTCCCCGCC GTCCCCGCCA TCGACCTGGC CCCGCGCGAG CGCCTGTCGC TGGACTTCGA CTGGCGCTTC AAGCTGGGTC ACGCCCAGGA TCCAGCCCGC GACTTCGGCT TCGGGGCCAA TCAGGGCACG TTCGCCAAGG CCGGCAAGGT CGTCGCCGCG GCCGAACTCG ACTTCGACGC CAGCGCCTGG GCGCCCGTCA CCCTGCCCCA CGACTGGGCC GTCGAGCTGC CCTTCGTCGA CAACCCCGCC TACGTCCCGT CCGGCAAGCC CGACGACGGG GACCCGCGCG CCGCCCACGG CTACAAGCCC CTGGGCCGCG AGTTCCCCGA GACCAGCATC GGCTGGTACC GCAAGACCTT CGCCCTTCCC GCCACCGACG CCGGCAAGCG GTTGTCGATC GAGTTCGACG GCGCCTTCCG CGACGCCTTG GTGATCGTCA ACGGCTACAT CCTCGAGCGC GAGGACAGCG GCTATTCGCC GTTTCGCGTC GACATCACCG ACATCGCCAA TGTCGGCGGC GACAACAGCC TGGTGGTGCG GATCGACGCC AGCCTCGGCG AGGGCTGGTT CTACGAGGGC GCGGGCCTCT ATCGCCACGT CTGGCTGGTC AAGACCGCCA CGGTTCACGT GCCCCAGTGG GGCGTGTTCG TGCGCGCCAA GCTCGACGGG ACCCTGACCA TCGACACCGA CCTGGTCAAC GAAGGCGACG CCCGCATTGA CTATGAGCTG GCCCACGCCG TGTTCGACGG TCAGGGCAAA CCGGTGCTGG CCCCTGCCCC GGCCACGGGC CTGCTGCCGG CCTGGGAGCG GCAGTCGCTG TCCCTGACCG CCCAGCTTCC GAACCCGGTC CCGTGGTCGC TGGAGACCCC GCATCTCTAC ACCCTGGCCA CCGAGGTCAG GGTCGGCGGC GCCGTGGTCG ACCGCTTCGT CACCCGGTTC GGCGTGCGGT CGATCGCCTT CGATCCCGAC AAGGGCTTCC TGCTGAACGG TCAGTCCGTG AAGCTGAAGG GAACCTGCAA TCACCAGGAC CACGCCGGGG TCGGCGCGGC CATTCCCGAC GCCCTGCAGG TCTGGCGGCT GGAGCAGCTC AAGTCGATGG GCTGCAACGC CTATCGCACC GCGCACAACC CGCCGACGCC CGAGTTGCTC GACGCCTGTG ACCGCCTGGG CATGGTGGTG ATCGACGAGA CCCGCCGAAT GTCCAGCGAT CCAACCTCGC TGGAGGAGTT GGAGCGTCTG GTCCGCCGCG ACCGCAACCA CCCGTCGGTG ATCCTCTGGT CGATCGGCAA CGAGGAGCCG CAGCAGGGCA CGGCCCGCGG CGCCAAGGTG GCGACCACGA TGAAGCGGCT GGTCAATCGC CTGGACCCAA CCCGCCTGGT CACCGCCGCC ATGGATCAGG GCTTTGGCGA GGGCATCAGC CCGGTCCTCG ACGTCCAGGG CTTCAACTAT CGCCACGAGA AGATGGACGA CTTCCACGCG CGCTTCCCGC ACGTGCCGAT CATCGGCACC GAGAGCGCCA GCACCGTGGC CACCCGCGGG GAATACGCCC GCGACGACGC CAAGAGCTAC GTCCCGGCCT ACGACACCGA GCACCCCTGG TGGGCGACCA CCGCCGAGAC GTGGTGGAGC CACGCGGCCG ACCGGCCGTG GGTGGCCGGC GGCTTCATCT GGACCGGCTT CGACTATCGC GGCGAGCCCA CCCCGTTCAA CCGCTGGCCC AGCATCAGCT CGCACTTCGG CGCGCTCGAC ACCTGCGGCT TCCCCAAGGA CAACTATTAC TACTACCGCG CCTGGTGGCG GCCCGAGCCG CTGTTGCACC TGTTGCCGCA CTGGAACTGG GAGGGCCGCG AGGGCCAACC CATCGCGGTC TGGGCGCACA GCAACTGCGA CAAGGTCGAG CTGTTCCTGA ACGGCAAGAG CCAGGGCGTT CGCCTTGTCA CCCCCAACAA CCACGTCGAA TGGTCCGTGC CCTATGCGCC CGGCGTGATC GAGGCTCACG GCTACAAGGG CGGCAAGCTC ATCCTGCGCG AGCGCCGCGA GACCGCCGGT CCCGCCGCCG CCCTGCGCCT CACCGTCGAC CGCTCGCGCC TGGCCGCCGA CGGCCAGGAT GTGGCGATCC TCAAGGTCGA GGTGCTGGAC GCCAAGGGCC GGCCCGCGCC CCGCGCCGAC GACCTGGTCT CGTTCACGCT CAGCGGTCCG GGCCAGGTGA TCGGAGTGGG CAACGGCAAT CCCACCAGCC ACGAGGCGGA CGTCGCCAGC CAGCGCAAGG CGTTCAACGG CCTGGTCCAG GCGATCGTCC GCACACGGCG TGGGCAGGCG GGGGAGCTGC GGGTGACGGC GTCGGCGGCG GGGCTGAAGC CAAGCACGAT GAGCGTGACG GTGGGATGA
|
Protein sequence | MVEFSRRQAL AATAAGAALA ATSPTRAAPS KGKPAPMVPA VPAIDLAPRE RLSLDFDWRF KLGHAQDPAR DFGFGANQGT FAKAGKVVAA AELDFDASAW APVTLPHDWA VELPFVDNPA YVPSGKPDDG DPRAAHGYKP LGREFPETSI GWYRKTFALP ATDAGKRLSI EFDGAFRDAL VIVNGYILER EDSGYSPFRV DITDIANVGG DNSLVVRIDA SLGEGWFYEG AGLYRHVWLV KTATVHVPQW GVFVRAKLDG TLTIDTDLVN EGDARIDYEL AHAVFDGQGK PVLAPAPATG LLPAWERQSL SLTAQLPNPV PWSLETPHLY TLATEVRVGG AVVDRFVTRF GVRSIAFDPD KGFLLNGQSV KLKGTCNHQD HAGVGAAIPD ALQVWRLEQL KSMGCNAYRT AHNPPTPELL DACDRLGMVV IDETRRMSSD PTSLEELERL VRRDRNHPSV ILWSIGNEEP QQGTARGAKV ATTMKRLVNR LDPTRLVTAA MDQGFGEGIS PVLDVQGFNY RHEKMDDFHA RFPHVPIIGT ESASTVATRG EYARDDAKSY VPAYDTEHPW WATTAETWWS HAADRPWVAG GFIWTGFDYR GEPTPFNRWP SISSHFGALD TCGFPKDNYY YYRAWWRPEP LLHLLPHWNW EGREGQPIAV WAHSNCDKVE LFLNGKSQGV RLVTPNNHVE WSVPYAPGVI EAHGYKGGKL ILRERRETAG PAAALRLTVD RSRLAADGQD VAILKVEVLD AKGRPAPRAD DLVSFTLSGP GQVIGVGNGN PTSHEADVAS QRKAFNGLVQ AIVRTRRGQA GELRVTASAA GLKPSTMSVT VG
|
| |