Gene Caul_1832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1832 
Symbol 
ID5899287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1939481 
End bp1941907 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content68% 
IMG OID641562322 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001683459 
Protein GI167645796 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0261035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC AGACCGCCGT CACCCGTCGT ACCCTGGGCG TTAGCCTGGC CGCCCTGGCC 
GCCGCGTCCT CGACCCTGGC CCGCGCCGCC GCGCCGCCCA AGGCTGGAAA GGTCGAAAGG
GCGCTCTACA AGGATCCGAC CCAGTCGATC GAACTGCGGG TGCGCGACCT GCTCTCGCGC
ATGACGCTGG AAGAGAAGGC CGCCCAGCTG GTCGGCATCT GGCTCACCAA GGCCAAGATC
CAGACCCCGA ACGGCGACTT TTCGCCCGAA GAGGCCAGCA AGAACTTCCC CGACGGCCTG
GGCCAGATCT CTCGCCCAAC CGACCGCCGC GGCCTGAAGC CCGCCACGGT CGTGGGCGCC
GCCGCCGGCG CCGAGGACGG CTCGATCGGC CGCAACGCCA AGGAGACCGC CCGCTACATC
AACGCCGCCC AGAAGTGGGC GATGGAGAAG ACGCGCCTGG GCGTTCCGAT GCTGATGCAC
GACGAGGCCC TGCACGGCTA TGTGGCCCGC GACGCCACCA GCTTCCCTCA GGCCATCGCC
CTGGCCTCGA CCTTCGATAC CGAGATGACC GAGAAGGTCT TCGCGGTCGC CGCCCGCGAG
ATGCGCGCCC GGGGCTCGAA CATCGCCCTG GCCCCGGTGG TCGACGTGGC CCGCGACCCG
CGCTGGGGCC GCATCGAGGA GACCTACGGC GAGGACCCGC ACCTGTGCGC CGAGATCGGC
CTGGCGGCGA TTCGCGGCTT CCAGGGCAAG ACTCTGCCGC TGGCGCCCGA CAAGGTGTTC
GTCACCCTCA AGCACATGAC CGGCCACGGC CAGCCCGAGA ACGGCACCAA TGTCGGCCCG
GCCCAGATCG CCGAGCGCAC CCTGCGCGAA AACTTCTTCC CGCCGTTCGA ACGCGCGGTG
AAGGAGCTGC CCGTTCGTTC CGTCATGCCC TCGTACAACG AGATCGACGG CGTCCCGTCG
CACGCCAACC GCTGGCTGCT GACCGACATC CTGCGCAAGG AGTGGGGCTA CAAGGGTTCG
GTGCAGAGCG ACTATTTCGC GATCAAGGAA TTGATGGGCC GTCACAAGCT GACCGACGAC
CTGGGCGAGA CGGCCGTCAT GGCCATGAAC GCCGGCGTCG ATGTCGAGCT GCCGGACGGT
GAGGCCTACG CCCTGCTGCC CCAACTGGTG AAGGTCGGAC GCATCCCCCA GGCCGCCGTT
GACCAGGCCG TCGAGCGCGT CCTGACGATG AAGTTCGAGG GCGGCCTGTT CGAAAACCCC
TATGCCGACG AGAAGACGGC CGACGCCAAG ACCGCGACGC CGGACGCCAT CGCCCTGGCC
CGCGAGGCGG CCCGCAAGGC CGTGGTGCTG CTGAAGAACG ACAAGGGCGT GCTGCCGCTC
AATCCCTCGA AGTTCAAGCG CCTGGCCCTC TTGGGAACTC ACGCAAAGGA CACCCCGATC
GGCGGCTACA GCGACACGCC GCGCCATGTG GTGTCGATCT ACGAGGGCCT GCAGGCCGAG
GCCAAGAAGA GCGGCTTCAC GCTGGACTAC GCCGAGGCCG TGCGGATCAC GGAGGCCCGG
ATCTGGGCCC AGGACGAGGT CAAGCTGGTC GATCCGGCCG TCAACGCCAA GCTGATCGCC
GAGGCGGTGG AGGTGGCCAA GCAAGCCGAC GTCATCGTCA TGGTGCTGGG CGACAACGAG
CAGACCAGCC GCGAGGCCTG GGCCGACAAC CACCTGGGCG ACCGCGACAG CCTGGACCTG
ATCGGTCAGC AGAACGACCT GGCCAGGGCG ATCTTCGACC TGGGCAAGCC CACGGTGGTG
TTTCTGCTCA ACGGCCGCCC GCTGTCGATC AACCTGCTGG CGCAGCGCGC GGACGCCGTC
ATCGAGGGTT GGTACCTGGG GCAGGAAACC GGCAACGCCG CCGCCGACAT CCTGTTCGGC
CGCGCCAATC CGGGCGGCAA GCTGCCGGTC AGCATCGCCC GCGATGTGGG CCAGCTGCCG
ATCTACTACA ACCGCAAGCC CACGGCTCGC CGGGGTTACC TGCTGGGCGA CACCTCGCCG
CTCTATCCGT TCGGTTTCGG CCTGTCGTAC ACCACGTTCG ACATCTCGGC CCCGCGTCCG
GCCAAGGCCG AGATCGGCGC CAACGAGAGC GTCAAGGTCG AGATCGACGT GATCAACACC
GGCAAGGTCG CCGGCGACGA GGTGGTGCAG CTCTATATCC ACGACGAGGC CGCCTCGGTG
ACCCGTCCGG TGCTGGAGCT CAAGCACTTC AAGCGCGTGA CCCTGGCCCC CGGCGCCAAG
CAGACCGTGA CCTTCGAGGT CTCGCCGCTG GACCTGTCGC TGTGGAACCT GGAGATGAAG
CGCGTGGTCG AGCCGGGCAA GTTCACCCTG CTGTCGGGGC CCAATTCCGC GCAGTTGAAG
CCGGCGACGC TGACGGTCAT GGCTTAA
 
Protein sequence
MSSQTAVTRR TLGVSLAALA AASSTLARAA APPKAGKVER ALYKDPTQSI ELRVRDLLSR 
MTLEEKAAQL VGIWLTKAKI QTPNGDFSPE EASKNFPDGL GQISRPTDRR GLKPATVVGA
AAGAEDGSIG RNAKETARYI NAAQKWAMEK TRLGVPMLMH DEALHGYVAR DATSFPQAIA
LASTFDTEMT EKVFAVAARE MRARGSNIAL APVVDVARDP RWGRIEETYG EDPHLCAEIG
LAAIRGFQGK TLPLAPDKVF VTLKHMTGHG QPENGTNVGP AQIAERTLRE NFFPPFERAV
KELPVRSVMP SYNEIDGVPS HANRWLLTDI LRKEWGYKGS VQSDYFAIKE LMGRHKLTDD
LGETAVMAMN AGVDVELPDG EAYALLPQLV KVGRIPQAAV DQAVERVLTM KFEGGLFENP
YADEKTADAK TATPDAIALA REAARKAVVL LKNDKGVLPL NPSKFKRLAL LGTHAKDTPI
GGYSDTPRHV VSIYEGLQAE AKKSGFTLDY AEAVRITEAR IWAQDEVKLV DPAVNAKLIA
EAVEVAKQAD VIVMVLGDNE QTSREAWADN HLGDRDSLDL IGQQNDLARA IFDLGKPTVV
FLLNGRPLSI NLLAQRADAV IEGWYLGQET GNAAADILFG RANPGGKLPV SIARDVGQLP
IYYNRKPTAR RGYLLGDTSP LYPFGFGLSY TTFDISAPRP AKAEIGANES VKVEIDVINT
GKVAGDEVVQ LYIHDEAASV TRPVLELKHF KRVTLAPGAK QTVTFEVSPL DLSLWNLEMK
RVVEPGKFTL LSGPNSAQLK PATLTVMA