Gene Caul_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4083 
Symbol 
ID5901545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4427107 
End bp4429605 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content70% 
IMG OID641564603 
Productglycoside hydrolase family protein 
Protein accessionYP_001685705 
Protein GI167648042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAAT TCAGCCGCCG CCAGGCCTTG GCCGCCACCG CCGCGGGCGC CGCCTTGGCC 
GCCACGTCTC CCACGCGCGC CGCCCCCTCG AAGGGCAAGC CGGCCCCTAT GGTCCCCGCC
GTCCCCGCCA TCGACCTGGC CCCGCGCGAG CGCCTGTCGC TGGACTTCGA CTGGCGCTTC
AAGCTGGGTC ACGCCCAGGA TCCAGCCCGC GACTTCGGCT TCGGGGCCAA TCAGGGCACG
TTCGCCAAGG CCGGCAAGGT CGTCGCCGCG GCCGAACTCG ACTTCGACGC CAGCGCCTGG
GCGCCCGTCA CCCTGCCCCA CGACTGGGCC GTCGAGCTGC CCTTCGTCGA CAACCCCGCC
TACGTCCCGT CCGGCAAGCC CGACGACGGG GACCCGCGCG CCGCCCACGG CTACAAGCCC
CTGGGCCGCG AGTTCCCCGA GACCAGCATC GGCTGGTACC GCAAGACCTT CGCCCTTCCC
GCCACCGACG CCGGCAAGCG GTTGTCGATC GAGTTCGACG GCGCCTTCCG CGACGCCTTG
GTGATCGTCA ACGGCTACAT CCTCGAGCGC GAGGACAGCG GCTATTCGCC GTTTCGCGTC
GACATCACCG ACATCGCCAA TGTCGGCGGC GACAACAGCC TGGTGGTGCG GATCGACGCC
AGCCTCGGCG AGGGCTGGTT CTACGAGGGC GCGGGCCTCT ATCGCCACGT CTGGCTGGTC
AAGACCGCCA CGGTTCACGT GCCCCAGTGG GGCGTGTTCG TGCGCGCCAA GCTCGACGGG
ACCCTGACCA TCGACACCGA CCTGGTCAAC GAAGGCGACG CCCGCATTGA CTATGAGCTG
GCCCACGCCG TGTTCGACGG TCAGGGCAAA CCGGTGCTGG CCCCTGCCCC GGCCACGGGC
CTGCTGCCGG CCTGGGAGCG GCAGTCGCTG TCCCTGACCG CCCAGCTTCC GAACCCGGTC
CCGTGGTCGC TGGAGACCCC GCATCTCTAC ACCCTGGCCA CCGAGGTCAG GGTCGGCGGC
GCCGTGGTCG ACCGCTTCGT CACCCGGTTC GGCGTGCGGT CGATCGCCTT CGATCCCGAC
AAGGGCTTCC TGCTGAACGG TCAGTCCGTG AAGCTGAAGG GAACCTGCAA TCACCAGGAC
CACGCCGGGG TCGGCGCGGC CATTCCCGAC GCCCTGCAGG TCTGGCGGCT GGAGCAGCTC
AAGTCGATGG GCTGCAACGC CTATCGCACC GCGCACAACC CGCCGACGCC CGAGTTGCTC
GACGCCTGTG ACCGCCTGGG CATGGTGGTG ATCGACGAGA CCCGCCGAAT GTCCAGCGAT
CCAACCTCGC TGGAGGAGTT GGAGCGTCTG GTCCGCCGCG ACCGCAACCA CCCGTCGGTG
ATCCTCTGGT CGATCGGCAA CGAGGAGCCG CAGCAGGGCA CGGCCCGCGG CGCCAAGGTG
GCGACCACGA TGAAGCGGCT GGTCAATCGC CTGGACCCAA CCCGCCTGGT CACCGCCGCC
ATGGATCAGG GCTTTGGCGA GGGCATCAGC CCGGTCCTCG ACGTCCAGGG CTTCAACTAT
CGCCACGAGA AGATGGACGA CTTCCACGCG CGCTTCCCGC ACGTGCCGAT CATCGGCACC
GAGAGCGCCA GCACCGTGGC CACCCGCGGG GAATACGCCC GCGACGACGC CAAGAGCTAC
GTCCCGGCCT ACGACACCGA GCACCCCTGG TGGGCGACCA CCGCCGAGAC GTGGTGGAGC
CACGCGGCCG ACCGGCCGTG GGTGGCCGGC GGCTTCATCT GGACCGGCTT CGACTATCGC
GGCGAGCCCA CCCCGTTCAA CCGCTGGCCC AGCATCAGCT CGCACTTCGG CGCGCTCGAC
ACCTGCGGCT TCCCCAAGGA CAACTATTAC TACTACCGCG CCTGGTGGCG GCCCGAGCCG
CTGTTGCACC TGTTGCCGCA CTGGAACTGG GAGGGCCGCG AGGGCCAACC CATCGCGGTC
TGGGCGCACA GCAACTGCGA CAAGGTCGAG CTGTTCCTGA ACGGCAAGAG CCAGGGCGTT
CGCCTTGTCA CCCCCAACAA CCACGTCGAA TGGTCCGTGC CCTATGCGCC CGGCGTGATC
GAGGCTCACG GCTACAAGGG CGGCAAGCTC ATCCTGCGCG AGCGCCGCGA GACCGCCGGT
CCCGCCGCCG CCCTGCGCCT CACCGTCGAC CGCTCGCGCC TGGCCGCCGA CGGCCAGGAT
GTGGCGATCC TCAAGGTCGA GGTGCTGGAC GCCAAGGGCC GGCCCGCGCC CCGCGCCGAC
GACCTGGTCT CGTTCACGCT CAGCGGTCCG GGCCAGGTGA TCGGAGTGGG CAACGGCAAT
CCCACCAGCC ACGAGGCGGA CGTCGCCAGC CAGCGCAAGG CGTTCAACGG CCTGGTCCAG
GCGATCGTCC GCACACGGCG TGGGCAGGCG GGGGAGCTGC GGGTGACGGC GTCGGCGGCG
GGGCTGAAGC CAAGCACGAT GAGCGTGACG GTGGGATGA
 
Protein sequence
MVEFSRRQAL AATAAGAALA ATSPTRAAPS KGKPAPMVPA VPAIDLAPRE RLSLDFDWRF 
KLGHAQDPAR DFGFGANQGT FAKAGKVVAA AELDFDASAW APVTLPHDWA VELPFVDNPA
YVPSGKPDDG DPRAAHGYKP LGREFPETSI GWYRKTFALP ATDAGKRLSI EFDGAFRDAL
VIVNGYILER EDSGYSPFRV DITDIANVGG DNSLVVRIDA SLGEGWFYEG AGLYRHVWLV
KTATVHVPQW GVFVRAKLDG TLTIDTDLVN EGDARIDYEL AHAVFDGQGK PVLAPAPATG
LLPAWERQSL SLTAQLPNPV PWSLETPHLY TLATEVRVGG AVVDRFVTRF GVRSIAFDPD
KGFLLNGQSV KLKGTCNHQD HAGVGAAIPD ALQVWRLEQL KSMGCNAYRT AHNPPTPELL
DACDRLGMVV IDETRRMSSD PTSLEELERL VRRDRNHPSV ILWSIGNEEP QQGTARGAKV
ATTMKRLVNR LDPTRLVTAA MDQGFGEGIS PVLDVQGFNY RHEKMDDFHA RFPHVPIIGT
ESASTVATRG EYARDDAKSY VPAYDTEHPW WATTAETWWS HAADRPWVAG GFIWTGFDYR
GEPTPFNRWP SISSHFGALD TCGFPKDNYY YYRAWWRPEP LLHLLPHWNW EGREGQPIAV
WAHSNCDKVE LFLNGKSQGV RLVTPNNHVE WSVPYAPGVI EAHGYKGGKL ILRERRETAG
PAAALRLTVD RSRLAADGQD VAILKVEVLD AKGRPAPRAD DLVSFTLSGP GQVIGVGNGN
PTSHEADVAS QRKAFNGLVQ AIVRTRRGQA GELRVTASAA GLKPSTMSVT VG