Gene Caul_0650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0650 
Symbol 
ID5898105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp718229 
End bp719242 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content71% 
IMG OID641561132 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001682281 
Protein GI167644618 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.349018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.530155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA CCCAACAACA ACCCATCGCC CTGGTGATCG GCGCGGCCGG CGGCATCGGC 
GGTGAAGCCG CCGCCGCCCT GGCTCGCCAC GGCTGGACGG TCCGTGGCCT GACCCGCCGT
CCGATGCCGG CCCGGGCCGG GCTGGACTGG ATATCCGGCG ACGCCATGGT CGCCGCCGAC
GTGGCCCGCG CCGCCCAGGG CGTCTGCCTG ATCGTCCACG CCGCCAATCC GCCCGGCTAC
AAGAACTGGG GCAAGCTGGT GCTGCCGATG CTCGACAACA CCCTCGCCGC CGCCAAAGCT
AACGGCGCGC GGATCCTGTT GCCGGGCACG GTCTATAATT TCGGCCTGGA GGCCCTCCCG
CTGATCGACG AGACCGCGCC GCAAAACCCG ACCACCCGCA AGGGCAAGAT CCGGGTCGAG
ATGGAGCGGC GGCTGAAGGC GGCCTCCACG CAGGGAACAC CGGTGCTGAT CGTCCGGGCC
GGCGACTTCT TCGGGCCGCA CGCCGGCAAC AACTGGTTTG GCCAACTGGT GATCAAGCCG
GGCGCGCCGC TGAAGTCGGT GACCGAACCC TCGGCCAAGG GCCTGGCCCA CGCCTGGGCC
TATCTGCCGG ACTTGGCCGA GACCATGGCC CGGATCCTGG AGCGCGCCGA TCGCCTGAGC
GCCTTCGAGG TCTTCCATTT CGGCGGCCAC CTGCTGGCCT GGGGCGAGAT GGCGGCTTCG
GTGCGGCGGG TGACGGGCCA GCCAAAGTTG CCGGTGATGG GCTTCCCGTG GTGGCTGGTC
ATGGCGCTGT CGCCTGTGGT GCGGATCTTC GGGGAGATGG CCGAGATGCG CTACCTGTGG
CGCGAGCCGC TGGTGCTGGA TGATCGCAAG CTGCGGGCGT TCCTGGGCGA CGTGCCGCAC
ACGCCGCTGG ACCGGGCGGT GGAGGCCAGC CTGCGGGCGC TGGGGTGTTT GCCGATCGCC
AACCCTCTCC CGGCGGGAGA GGGAGGGACC CGCCGCGTAG CGGTGGGAGG GTGA
 
Protein sequence
MTATQQQPIA LVIGAAGGIG GEAAAALARH GWTVRGLTRR PMPARAGLDW ISGDAMVAAD 
VARAAQGVCL IVHAANPPGY KNWGKLVLPM LDNTLAAAKA NGARILLPGT VYNFGLEALP
LIDETAPQNP TTRKGKIRVE MERRLKAAST QGTPVLIVRA GDFFGPHAGN NWFGQLVIKP
GAPLKSVTEP SAKGLAHAWA YLPDLAETMA RILERADRLS AFEVFHFGGH LLAWGEMAAS
VRRVTGQPKL PVMGFPWWLV MALSPVVRIF GEMAEMRYLW REPLVLDDRK LRAFLGDVPH
TPLDRAVEAS LRALGCLPIA NPLPAGEGGT RRVAVGG