Gene Caul_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2140 
Symbol 
ID5902586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2313953 
End bp2316199 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content69% 
IMG OID641562630 
ProductBeta-glucosidase 
Protein accessionYP_001683766 
Protein GI167646103 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0303148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0291608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAAC AGACCTGGCG GGGCGTGACC CTGGCCCTGA TGCTCGGCGC ATCCTCCTGC 
GCCCTGGCGC CCGCCGCCTG CGCCCAGGCG CCCGCGCCCG CCGCCCGTCC GTGGCTTGAT
CCCAAGCTGG GCGCCGACAC GCGCGCCGAC CTGGCGCTCA AGGCCATGAC CCAGGACGAG
AAGCTGACGA TCATCTTCGG CTATTTCGGC GCCGACATGG CCCCCAAGTA TAAGCGCGTG
GCCGACGCCC TGCCCGGCTC GGCCGGCTAT GTGCCGGGGA TCGCGCGCCT GGGCATTCCC
GCCCAGTTCC AGACCGACGC CGGCGTCGGC GTGGCCACCC AGGGGGGCGA GCCCAACAAG
CGCGAACGCA CCGCCCTGCC CTCCGGCATG GCCACCGCCG CGACCTGGAA TCCGAAACTG
GCCCAGGCCG GCGGCGCGAT GATCGGCGCC GAGGCCCGCT CGTCCGGCTT CAACGTCATG
CTGGCCGGCG GCGTGAACCT GGTCCGTGAA CCGCGCAACG GCCGCAACTT CGAATATGGC
GGCGAGGATC CGTGGCTGGC CGGCCAGATG GTCGGGGCCC AGATCAAGGG CATCCAGTCC
AACGCCATCA TCTCGACCAT CAAGCACTAC GCCCTGAACG GCCAGGAGAC CGGCCGCTTC
GTGCTGGACG CCAAGATCGG CGAAGGCGAG GCCCGCACCT CCGACCTGCT GGCCTTCCAG
TTCGCCACCG AGATCGCGGA CCCCCACTCG GTGATGTGCG CCTACAACAA GGTCAATGGC
GACTACGCCT GCGAGAACGA TTTCCTGCTC AACAAGGTTC TCAAGCAGGA CTGGGCCTAC
AAGGGCTATG TGATGTCGGA TTGGGGCGCG CACCATTCCA GCGCCAAGGC CGCCAATGCG
GGCCTGGACC AGGAATCGGC CGGCGACGCC TTCGACAAGC AGCCCTTCTT CAAGGGTCCG
CTGAAGGACG CCCTGGCCAA GGGCGAGGTG TCCCAGGCCC GGCTCGACGA CATGGCCCGC
CGCATTCTGC GCAGCCTGTT CGCCAGCGGC GTGGTCGAAA AGCCGGTGAA GATCGAGACC
ATCGACTACG CCGCCCACGC CAAGGTCACG CAGGCCGACG CCGAGGAAGG CATTGTCCTG
CTGAAGAATG ACAAGGGCCT GCTGCCGCTC GTCGCCAGCG CCAAGAAGAT CGTCGTGATC
GGCGGCCACG CCGATGTCGG CGTGCTGTCG GGCGGCGGCT CCTCGCAGGT CTATCCGATC
GGCGGCCGCG CGGTGCAGGG CGAAGGTCCC GCCACTTGGC CGGGTCCGAT GATCTACTTC
CCCTCCTCGC CCCTGAAGGC GCTGAAGGCC CGCCTGCCGG GCGCCGACAT CCAGTACATC
AACGGCAAGG ACAAGGCCGC CGCCGCCAAG CTGGCGACCG GCGCCGACGT GGTGCTGGTG
TTCGCCACCC AATGGAACGG CGAGTCGTTC GACAGCCCGC TGACTCTGGA AAACGACCAG
GACGCCCTGA TCGACGCCGT CGCCTCAGCC AACGCCAAGA CCGTGGTGGT GCTGGAGACC
GGCGGCCCGG TGCTGATGCC CTGGCTCGAC AAGGTGGGCG GCGTGGTCGA GGCCTGGTAT
CCCGGTTCGG AAGGCGGCGA GGCCATCGCC CGCGTGCTCA CCGGCGAGGT CGACGCCTCG
GGCCGCCTGC CCGTCACCTT CCCCGCCGCC CTGGCCCAGT TGCCGCGTCC GGTGCTGGAC
GGCGACCCCA AGAAGCCCGA CGACAGCTTC CCGGTCGACT ATACGATCGA GGGCGCGACG
GTCGGCTACA AGTGGTTCGA CAAGAAGGGC CAGCAACCGC TGTTCGCGTT CGGCCACGGC
CTGTCCTACA CCAGCTTCGC CTACGCCAAC CTCAAGGCCG AGGCCCGGAA CGGCGCCCTG
ACCGTCAGCT TCGACGTCAG GAACACCGGC CGGCGAACCG GCAAGGCCGT GCCGCAGGTC
TATGTCTCGC CCAAGGCCGG GGGATGGGAA GCGCCTCAGC GTCTGGCCGC CTTCAGCAAG
GTCGAGCTGG CGCCCGGCGC GACCCAGAAC GTCACCCTGA CCATCGATCC GCGCCTGCTG
GCCGCCTGGG ACGACAAGGC CCACGGCTGG TCGATCGCGG CTGGCGACTA CACCGTCACC
CTGGGCGCTT CGTCACGCGA CACCGCCGCC AAGGCGGACG TCGCGGTGGC GGCCCGGACC
GTTCCTGTCG GCCTGATGAA GCCCTGA
 
Protein sequence
MNQQTWRGVT LALMLGASSC ALAPAACAQA PAPAARPWLD PKLGADTRAD LALKAMTQDE 
KLTIIFGYFG ADMAPKYKRV ADALPGSAGY VPGIARLGIP AQFQTDAGVG VATQGGEPNK
RERTALPSGM ATAATWNPKL AQAGGAMIGA EARSSGFNVM LAGGVNLVRE PRNGRNFEYG
GEDPWLAGQM VGAQIKGIQS NAIISTIKHY ALNGQETGRF VLDAKIGEGE ARTSDLLAFQ
FATEIADPHS VMCAYNKVNG DYACENDFLL NKVLKQDWAY KGYVMSDWGA HHSSAKAANA
GLDQESAGDA FDKQPFFKGP LKDALAKGEV SQARLDDMAR RILRSLFASG VVEKPVKIET
IDYAAHAKVT QADAEEGIVL LKNDKGLLPL VASAKKIVVI GGHADVGVLS GGGSSQVYPI
GGRAVQGEGP ATWPGPMIYF PSSPLKALKA RLPGADIQYI NGKDKAAAAK LATGADVVLV
FATQWNGESF DSPLTLENDQ DALIDAVASA NAKTVVVLET GGPVLMPWLD KVGGVVEAWY
PGSEGGEAIA RVLTGEVDAS GRLPVTFPAA LAQLPRPVLD GDPKKPDDSF PVDYTIEGAT
VGYKWFDKKG QQPLFAFGHG LSYTSFAYAN LKAEARNGAL TVSFDVRNTG RRTGKAVPQV
YVSPKAGGWE APQRLAAFSK VELAPGATQN VTLTIDPRLL AAWDDKAHGW SIAAGDYTVT
LGASSRDTAA KADVAVAART VPVGLMKP