Gene Caul_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2133 
Symbol 
ID5899588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2301271 
End bp2304705 
Gene Length3435 bp 
Protein Length1144 aa 
Translation table11 
GC content69% 
IMG OID641562622 
Productglycoside hydrolase family protein 
Protein accessionYP_001683759 
Protein GI167646096 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.283843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.156102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC TGCGCCAATG CCTTTTGAAG ACCGCTCTGC TGGCCGCGAC CGCCCTGACG 
CTGGCCGCGC CCGTCGCCGT AGCCGCCCAG GTCGCCTGGG GTCCTTACAA CGCCGACTTC
CCCGCCGGTG GCGACGGTCT TTCCCGACCT CTGGCCGGCA AGGTTCAAGG CGACGTCCTG
CAGGCGGGCG GCTCTTGGTC GATCCATGGC TGGGTGCTGG CTTCCGAGGC GGCGTCCGGA
CCCAGCCTGG TGGCGGGTCT GGGCGACCCC GCGGCAGGCG GTCGGTTCCT GACGATCGAC
GGCGGGGCGT TCGGCGTCTG GACCGGCGGC CAGCCCTTGA ACGTCAAGGC CGCGATCAAG
CCCGGCGATT GGCGGTTCGT CGCCGCCGTG TCGGACGGCG CCAAGGTCAC GCTCTATCTC
GACGGCGCGC CGGTCGGCGA GGCGCCCGCG GCGATCGCGG CGACCCCGGC CGTGATCTCC
CTGGCGCCGC GCAAGGTCGC CGGCTTTGCG CCCTTTGCCG GTCGCCTCGC CGATGTCTCG
GCCGAGGACC GCGTCCTGTC AGCCGCCGAG ATCAAGACCC TGGCGGCGCG CAGGCCTGAT
CCGCTGACCG TCTATGAGTC CGGCAGTCCT GTCTGGCCCG TGCAGGTGCG CCAGATGTAC
GGTCAGGTCG CGCCGCAGGA CGCCTGGACC CGGCCCAAGA GCAAGGGCGC GATCTCGGCT
CCGGTGGCCA AGCCTGCCTA TGCCGGTCCC GCGCTGGTCG CCGACGGCGC CGGAACCTGG
ACCCTCAAGC GCTGGGCGCT GGTCGAGGCG CCCAAGGTTT CGGAGGGCGG TGCGGCGGTG
TCGGCGCCGG CCTATGACGC CAAGGGTTGG TACGCCGCCA CCGTGCCCGG CACGGTCCTG
ACCACCCTGG TCGACCGCGG CGTCTATCCC GATCCCGACT ACGGCCTTAA CAACACGGCC
ATTCCCGAGA GCCTGAACAA GCAGGATTAT TGGTACCGCA GCGCGTTCGA GGCCCCGGCC
GACGCCGCCG GCAAGCATCT GCTGCTGACC TTCAAGGGGA TCAACTACGC CGCCGAGATC
TGGCTCAACG GCGAGAAGCT GGGCGACCTG AAGGGCGCTT TCATCCGTGG ACGGTTCGAC
CTCACCGGCA AGTTGAAGCC AGGACAGAAC GCCATCGCGG TCAAGGTCTC GCCGCCGCCG
CATCCGGGGA TCGCCCATGA GGAGTCCCTG AGCGCTGGGG TTGGCGAGAA CGGCGGCATG
ATGGCCCTGG ACGGCCCGAC CTTCATCGCC AGCGAGGGCT GGGACTGGAT CCCTAGCGTT
CGCGACCGCA ACACCGGCCT GTGGCAGGAC GTGGTGTTGA CCGTCAGCGG CGAGGCGCGG
CTGGGCGACG CCCATGTCGT CACCACGCTG CCGAAGGCCG ACAACAGCGT CGCCGACATC
GAGATCACCG CCCCGATCGG GAACCTGACT GGCCATCCGG TGCAGGCCGT CGTCACGGCC
GCCTTCGATG GGGTCAGCGT GTCCAAGACC GTCACCCTGG CCCCCGGCGC CGGATCGGTG
GTTCTGAGTC CGGCGGAGTT CCCGCAGCTG TCGGTGAAGA ACCCCAAGCT CTGGTGGCCC
AACGGGTACG GCGATCCGGC CCTGCATGAC CTGAAGCTGT CGGTGGCCAT CGACGGCCAG
GTTTCCGACG ACAAGACGAT CCGGTTCGGC ATCCGCCAGA TCACCTATGA CCTGTCGCTG
ATGAACCCCT CTGGCCATCT GCGCCGGGTC GAGATCGACT TCTCCAAGGC CCGCCAACTG
GGCCAGGACG TTACCGACGG CAGCCACGAG GGCGTCCGCA AGGTGCCGGA CGGCTGGGCC
ACCTCCCTGA CCGCGCAGGG CGACAGCTCC GTGGCCGTAC GCGACATCCC CGACACCGGC
CTGACGCCCT ACCTCGTGAT CAAGGTCAAT GGCGTGAAGA TCGCGGCGCG GGGTGGCAAC
TGGGGCACGG ACGATTGGCG CAAGCGCGTC GACCGCGCTC GCCTGGAGCC CTATTTCCGT
CTGCACCATG ACGCCCACCT CAACACCATC CGCAACTGGG TGGGGCAGAA CACCGAAGAC
GTGTTCTACG ACCTGGCCGA CGAGTACGGC CTGCTGGTGC TCAACGACTT CTGGGCTTCG
ACCCAGGACT ACCAGCTGGA GCCGCAGGAC GTGCCGCTGT TCCTGGCCAA CGCCGCCGAT
GTGATCGGCC GCTACCGCAA CCACCCTTCG ATCGCCCTGT GGTTCGGCCG CAACGAAGGC
ATCCCGCAGC CGATTCTCAA CGAGGGCCTG GAAGCGCTGG TCCACGATCT GGACGGCACG
CGCTGGTACA CCGGCAGCTC CAACCGGGTG AACCTGCAAA ACAGCGGACC CTACAGCTAC
AAGGAGCCGC AGACCTATTT CGCCGACCAC GCCAAGGGCT TCTCGGTTGA GGTCGGCACG
CCGTCGTTCC CGACCCTGGA GGCCTTCGAA GCGGCCGTGC CGCAGCCCGA TCGCTGGCCG
ATCAGCGACG CCTGGGCCTA TCACGACTGG CACCCGACCG GGAACGGCGC GACCAAGTCG
TTCCTCGACG CCATGACCGC CAAGCTGGGT CCGCCGACCA GCCTGGAGGA CTTCGAGCGC
AAGGCCCAGC TGATGAACTA CGAGACGCAC AGGGCGATCT TCGAGGGCAT GAACGCCGAG
CTCTGGACCA GGTCGTCGGG CCGCCTGCTG TGGATGACCC AGCCCGCATG GCCCTCGACC
ATGTGGCAGA TCCTCAGCCA CGACTACGAC ACCCACGCCG CGTTCTACGG GACACAGAAG
GCCGCCGAGA TCGTCCACGT CCAGATGAGC CTGCCCGACC ACCGGCTGGA GCTGGTCAAT
AACGGCCTGA CGCCGATCGC CGGCGCCAGC CTGCGGGCCC GGGTCGTGGG ACTGGACGGC
AAGGCCCTGG CCGAACGGAC CTGGTGGATC GACGCGGCCG CCAACAGCAC GACCCAGGGC
GAGGTGCTGG ACCTGACCGA ACCCCTGGCC GCCCAGGGCG CGGTGGTGGT GCGGCTGGAC
CTGGCCGCCG CCGACGGGAC GCCGATGTCG AGCAACCTCT ACTGGCTGGC CCGCGACGCC
GAGGCCAGCC GCAAGCTGTC GGCCATGGCG GCCCAGCCGG TGACGATCAG CGCCAAGTCC
GCCAAGGCGG ACGCGGAGAC GGTGGTGACG GTGTCTCTGG CCAATACCGG CGCGGCACCG
GCCCTGAACG GCAAGCTGAC CCTGGTCGAC GCCAAGGGCG CGCGCATCCT GCCGGCCTAC
TACGCCGACA ACTATGTCTC GCTGCTGCCC GGCGAGCGGC GGACGGTGGA GGTTCGCTAT
CCGGGCGCGG TGACGGGCGC CAAGGTCGAG CTGCGCGGCT GGAACGTGAC CCCGGCCGTC
GCGGTGGTGC GCTGA
 
Protein sequence
MTTLRQCLLK TALLAATALT LAAPVAVAAQ VAWGPYNADF PAGGDGLSRP LAGKVQGDVL 
QAGGSWSIHG WVLASEAASG PSLVAGLGDP AAGGRFLTID GGAFGVWTGG QPLNVKAAIK
PGDWRFVAAV SDGAKVTLYL DGAPVGEAPA AIAATPAVIS LAPRKVAGFA PFAGRLADVS
AEDRVLSAAE IKTLAARRPD PLTVYESGSP VWPVQVRQMY GQVAPQDAWT RPKSKGAISA
PVAKPAYAGP ALVADGAGTW TLKRWALVEA PKVSEGGAAV SAPAYDAKGW YAATVPGTVL
TTLVDRGVYP DPDYGLNNTA IPESLNKQDY WYRSAFEAPA DAAGKHLLLT FKGINYAAEI
WLNGEKLGDL KGAFIRGRFD LTGKLKPGQN AIAVKVSPPP HPGIAHEESL SAGVGENGGM
MALDGPTFIA SEGWDWIPSV RDRNTGLWQD VVLTVSGEAR LGDAHVVTTL PKADNSVADI
EITAPIGNLT GHPVQAVVTA AFDGVSVSKT VTLAPGAGSV VLSPAEFPQL SVKNPKLWWP
NGYGDPALHD LKLSVAIDGQ VSDDKTIRFG IRQITYDLSL MNPSGHLRRV EIDFSKARQL
GQDVTDGSHE GVRKVPDGWA TSLTAQGDSS VAVRDIPDTG LTPYLVIKVN GVKIAARGGN
WGTDDWRKRV DRARLEPYFR LHHDAHLNTI RNWVGQNTED VFYDLADEYG LLVLNDFWAS
TQDYQLEPQD VPLFLANAAD VIGRYRNHPS IALWFGRNEG IPQPILNEGL EALVHDLDGT
RWYTGSSNRV NLQNSGPYSY KEPQTYFADH AKGFSVEVGT PSFPTLEAFE AAVPQPDRWP
ISDAWAYHDW HPTGNGATKS FLDAMTAKLG PPTSLEDFER KAQLMNYETH RAIFEGMNAE
LWTRSSGRLL WMTQPAWPST MWQILSHDYD THAAFYGTQK AAEIVHVQMS LPDHRLELVN
NGLTPIAGAS LRARVVGLDG KALAERTWWI DAAANSTTQG EVLDLTEPLA AQGAVVVRLD
LAAADGTPMS SNLYWLARDA EASRKLSAMA AQPVTISAKS AKADAETVVT VSLANTGAAP
ALNGKLTLVD AKGARILPAY YADNYVSLLP GERRTVEVRY PGAVTGAKVE LRGWNVTPAV
AVVR