Gene Caul_2918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2918 
Symbol 
ID5900373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3163903 
End bp3164904 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID641563415 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_001684543 
Protein GI167646880 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0348958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAC GCGTCATCCT CGTCACCGGC GGCGCCGGGT TCATCGGCTC CAATATCGTG 
GCCAAGCTGT GCGAGGACCA GAGTCTCGAC GTGGTGGTCT GCGATTGGCT GGGCGAGGCC
GCGGGGAACA AGTGGAAGAA CCTCGCCAAG TCGCCGATCG CCGACTTCGT GGCCCCGGAA
GATCTGTTCG ACTGGCTGGC CGAGCGCGGC GAAGAGGTCG AGCTGGTGAT CCACATGGGG
GCGATCTCGT CGACCACCGA GCCGGACGCC GACCTGATCG TGCAGTCCAA CTTCGTGCTG
TCGCGCGACC TGTGGCGGTG GTGCGCTCAA CATCAACGCC GGCTGATTTA TGCTTCTTCC
GCCGCCACTT ACGGCGATGG CCTGCAAGGC TTCGACGGCA AGGACGACCT GGAAAGCCTC
AAGGCCCTGC GGCCGCTGAA CGCCTATGGC TGGTCCAAGG CCTTGTTCGA CATCTACGCC
GTCCGCGAGG CCGCCCGTGG CCACGCCCCG ATCCAGTGGG TGGGCCTGAA GTTCTTCAAC
GTCTATGGCC CCAACGAGGA CCACAAGGCC GACATGAAGT CGGTGGTCTG TCAGATCTGG
CCCAAGGTGG CGGCGGGCGA GGGCGTGCGC CTGTTCAAGT CGCACCACCC CGACTACGAG
GATGGCGGCC AGCTGCGCGA TTTCGTCTAT GTGCGCGATG TGGCCGACGT GGTCGCCTGG
CTGGCCAAGA GCCCGCACGT CAACGGCGTC TACAATCTGG GCTCGGGCAA GGCTCGCTCA
TTCAAGGCCC TGGCCGAAGC TACCTTCCGC GCCGTGGGCC GCGAGCCCGA CATCGACTAT
TTCGACATGC CCGAAGTGCT GCGCGGCAAG TACCAGTACT ACACGCAAGC CGACATGAGC
CGGCTGCGCG CCGCCGGCTA CAACGCGCCG ATGACCACGC TGGAGGACGG CGTGGAGGAC
TATGTGGCGG GGTTCCTCAA TACGGATGAT CCGTATCGGT GA
 
Protein sequence
MSRRVILVTG GAGFIGSNIV AKLCEDQSLD VVVCDWLGEA AGNKWKNLAK SPIADFVAPE 
DLFDWLAERG EEVELVIHMG AISSTTEPDA DLIVQSNFVL SRDLWRWCAQ HQRRLIYASS
AATYGDGLQG FDGKDDLESL KALRPLNAYG WSKALFDIYA VREAARGHAP IQWVGLKFFN
VYGPNEDHKA DMKSVVCQIW PKVAAGEGVR LFKSHHPDYE DGGQLRDFVY VRDVADVVAW
LAKSPHVNGV YNLGSGKARS FKALAEATFR AVGREPDIDY FDMPEVLRGK YQYYTQADMS
RLRAAGYNAP MTTLEDGVED YVAGFLNTDD PYR