Gene Caul_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1303 
Symbol 
ID5898758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1376672 
End bp1379152 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content69% 
IMG OID641561788 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001682931 
Protein GI167645268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.75334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.19336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGGAG TTATGAACAC CAAGGCCTTC TGCGCCGCGT TGCTGGCGAC CACCCTCCTG 
TCGACCCCTT TCTTGAATGG CGCGGCGCTG GCCGCCGACA CCAAGAGCAC GGCCCATCCC
GCCCTGTGGC CCGCGGCCAA GAGCCAGGGC GTGGTCGATT CCCAGACCGA AGCCTTCGTC
GATTCCCTGC TGGCCAAGCT GACCCTCGAG GAAAAGGTCG GCCAGATGAT CCAGGGCGAC
ATCGGCTCGG TGAAGCCCGA AGACCTGAAG ACCTACCCCT TAGGCTCGAT CCTGGCCGGC
GGCAGCTCGC CGCCGCTGGG CGCGCCCGAC CGCTCGCCGA TCGGCCCGTG GGTCAAGTCG
GTCGAGGCGT TCCGCGCCGC GGCCGCGCAA CGCCAGGGCG GCACGCGGAT TCCGCTGATG
TTCGGCATCG ACTCCGTGCA CGGCCACGGC AACGCCGTGG GCGCGACGCT CTTCCCGCAC
AACATCGGGC TGGGCGCGGC GCGCGACCCC GAACTGATCC GCAAGATCGG CGCGGCCACC
GCCCAGGAAA CCGCCGCCAG CGGCTTCGAC TGGGCGTTCG GTCCCACCCT GACCGTGCCG
CGCGACGACC GCTGGGGTCG GACCTACGAG GGCTATTCGG AAGACCCCGA GATCGTCCGG
TCCTACGCCG GCCAGATGAT CCTGGGGCTG CAGGGCGCCG TCAGTCAGGG CGGCGTCATC
CAGCAGGGCC ACGTGGCGGC CAGCGCCAAG CATTTCCTGG GCGACGGCGG CACCCATGAC
GGCAAGGACC AGGGTGACAC CCAGGTCTCG GAAGCCGACC TGATCCGCCT GCACGCCCAG
GGCTATGTTC CGGCCGTCAA CGCCGGGACC CTGACCATCA TGGCGTCGTT CAACAGCTGG
AACGGCGAGA AGATGCACGG CAACAAGAGC CTGCTGACCG ACGTGCTGAA GGGCAAGATG
GGCTTCGACG GCTTCATCGT CGGCGACTGG AACGGCCACG GCCAGGTGGC CGGCTGTACG
CCCACCAACT GCGCCCAGGC CGCCAATGCG GGCCTGGACA TGTACATGGC CCCCGACAGC
TGGAAAGAGC TCTACGCCAA CACCCTGGCC CAGGCCAAGT CGGGCGAGAT CCCGATGGCC
CGCATCGACG ACGCCGTGCG CCGCATCCTG CGCGTGAAGG CCAAACTCGG CCTGTTCCAG
CAGGCGCGGC CGCTTGAGGG CAAGGAAGCG GTCATGGCCT CGGCCGACCA CCGCGCCATC
GCCCGCCAGG CGGTGCGCGA GTCGCTGGTG CTGCTGAAGA ACAACGGCGT GCTGCCGGTC
AAGGCCTCGG CCAACATCCT GGTCGCCGGC TCGGGCGCCG ATGACATCGG CCAGCAGGCC
GGCGGCTGGA CCCTGTCGTG GCAGGGCACC GGCAACACCA AGGCCGACTT CCCCAACGCC
CAGTCGATCT ATTCGGGCCT GAAGGAGACG GTCGAGGCTT CCGGCGGGAC GGCGACGCTC
AGCGTTGACG GAGCGTTCGA CAAGAAGCCC GACGTCGCCA TAGTGGTGTT CGGCGAGACG
CCCTACGCCG AGGGCGTGGG CGACATCAGG ACGCTGGAAT TCCAGCCGGG GACCAAGACC
GACCTCGCCC TGCTCAAGAC ACTGAAGGCG GCTGGCGTGC CCGTGGTGTC GGTGTTCCTC
AGCGGCCGGC CGCTGTGGGT CAATCCGGAG ATCAACGCCT CGGACGCCTT CGTCGCGGCC
TGGCTGCCGG GTTCGGAAGG CGGCGGGATC GCCGACGTGC TGATCGGCGA CAAGGCGGGA
AAGCCGCGCC ACGACTTCCG AGGCAAGCTG TCGTTCAGCT GGCCCAAGAC CGCCGGCCAG
TTCACGCTGA ACCGCGGCGA CAAGCGCTAC GACCCGCAGT TCGCCTATGG CCACGGCCTG
ACCTACGCCT CCAAGGTCCG TGTGGGGACG CTGTCGGAGA AGCCTGGCCT CACCGTGGCG
GCCGAGAACG TCAGCAACTA CTTCGTGGCC GGCAAGACGC CCGCGCCCTA TGAGTTCAGG
CTGACCCCGA CCACGGCCGT GCAGGTCCGT CCGGTGGACG CCGGCAACGT GCAGGAGGCC
GGTCGCCAGA TCACCTTCTC GGGCGACGTC CCGGCGACGG CCGCGATCTC GGGCGACCAG
GCCGACCTGA CGTTCCAGAC CAATGCCGAG ATGAGCCTGC TGATCGACTA TCGCCTCGAC
GCCAAGCCGA CCGGTCCGGT GACCCTGGCG ATCGGCAGGG GCAAGGTGGA CGTGACGCCG
GTGCTCAACG CTTCGCCGGT CGGCGAGTGG AAGAGCTTGA AGGTCCCGCT CAAGTGCTTC
CAGGCGGCGG GAACCGACGT GACCAAGGTC ACCGCGCCGT TCGAACTGTC GACCGCCGGC
AAGCTGACCG TGTCGCTGCA AGGGGCGAAG CTGAGCACCG ACCCGGCCGG GGCGACGTGC
CCAAGCAAGG CGGCGAACTA A
 
Protein sequence
MWGVMNTKAF CAALLATTLL STPFLNGAAL AADTKSTAHP ALWPAAKSQG VVDSQTEAFV 
DSLLAKLTLE EKVGQMIQGD IGSVKPEDLK TYPLGSILAG GSSPPLGAPD RSPIGPWVKS
VEAFRAAAAQ RQGGTRIPLM FGIDSVHGHG NAVGATLFPH NIGLGAARDP ELIRKIGAAT
AQETAASGFD WAFGPTLTVP RDDRWGRTYE GYSEDPEIVR SYAGQMILGL QGAVSQGGVI
QQGHVAASAK HFLGDGGTHD GKDQGDTQVS EADLIRLHAQ GYVPAVNAGT LTIMASFNSW
NGEKMHGNKS LLTDVLKGKM GFDGFIVGDW NGHGQVAGCT PTNCAQAANA GLDMYMAPDS
WKELYANTLA QAKSGEIPMA RIDDAVRRIL RVKAKLGLFQ QARPLEGKEA VMASADHRAI
ARQAVRESLV LLKNNGVLPV KASANILVAG SGADDIGQQA GGWTLSWQGT GNTKADFPNA
QSIYSGLKET VEASGGTATL SVDGAFDKKP DVAIVVFGET PYAEGVGDIR TLEFQPGTKT
DLALLKTLKA AGVPVVSVFL SGRPLWVNPE INASDAFVAA WLPGSEGGGI ADVLIGDKAG
KPRHDFRGKL SFSWPKTAGQ FTLNRGDKRY DPQFAYGHGL TYASKVRVGT LSEKPGLTVA
AENVSNYFVA GKTPAPYEFR LTPTTAVQVR PVDAGNVQEA GRQITFSGDV PATAAISGDQ
ADLTFQTNAE MSLLIDYRLD AKPTGPVTLA IGRGKVDVTP VLNASPVGEW KSLKVPLKCF
QAAGTDVTKV TAPFELSTAG KLTVSLQGAK LSTDPAGATC PSKAAN