Gene Caul_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1066 
Symbol 
ID5898521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1126997 
End bp1127962 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content72% 
IMG OID641561548 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001682694 
Protein GI167645031 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCC TGGTCCTGGG GGGAACAGGT TTCATCGGCG CGCCGCTGGT CGCGCGACTG 
CTGGCCGACG GTGTCGAGAC GGCCGTCGCC CACCGTGGTG CGCGCCCGCT CCCGGCTGGG
GCGACCTTGG TGACGCTCGA CCGCCGCGAC CCGGCGGCGG TGCTGGCGGC GGTTCGCGAC
CTGGGCGCCG ACACGGTGAT CGACCTGCTG GCCTATACGG CGGCCGACAC CCTGCCGCTG
CTGGACGCCC TGTCCGGCCA GATCGCCCGC TATGTGATGG TCAGCTCGGT CGACGTCTAC
GCCAATTACG AGGGCCTGCA TCGCAAGGGC CGGCCGACGC CGGTCTGGGA CCGGCTGACC
GAGGACGCGG CCCTGCGGAC CAGCCGCTAT CCCTACCGCC TGGCCAAGCC CCGGGCGGCG
AGCGACCCCC AGGCCTGGAT GGACGACTAC GACAAGATCC CGCTCGAGGA GGCCGCACGT
GAACGCCTGG GCGACGCCGC GACCATCCTG CGCCTGCCGA TGGTGTTCGG ACCCGGCGAC
CGCCAGCGGC GGTTCTCCTG GGCGATCCGC CCTATGGTCC AGGGCCGGCC GCGCTTCGTG
ATTCCCCATC CGTGGGCCAG CTGGCGGGCG ACCTTCGGCT ATGTCGATGA CGTGGCGGCC
GGGATCGCCC TGGCCGCCGT CCAGCCCCGG GCCGGCGGCG AGACCTACAA TCTGGGCCGC
GCCAACACCC CGACCAATAT CGCCTGGGCC GTCGCCTTCG CCGAGCATCT GAACTGGCCC
GGCGAGGTGC AACTGGCCCA TCCGGACGTG GCCCGAGGGG CCCTGGCGGC GGCGACGGCG
GGACTGGACC TCAGCTATCC GCTGTTCATC GACAACGCCA AGATCCGCCG GCGGCTGGGC
TATGCGGAGG TCACCGACTT CGACGAGGCC CTGGCCCGGA CGGTGGCGGA CGAGATGGGG
CGGTAG
 
Protein sequence
MSVLVLGGTG FIGAPLVARL LADGVETAVA HRGARPLPAG ATLVTLDRRD PAAVLAAVRD 
LGADTVIDLL AYTAADTLPL LDALSGQIAR YVMVSSVDVY ANYEGLHRKG RPTPVWDRLT
EDAALRTSRY PYRLAKPRAA SDPQAWMDDY DKIPLEEAAR ERLGDAATIL RLPMVFGPGD
RQRRFSWAIR PMVQGRPRFV IPHPWASWRA TFGYVDDVAA GIALAAVQPR AGGETYNLGR
ANTPTNIAWA VAFAEHLNWP GEVQLAHPDV ARGALAAATA GLDLSYPLFI DNAKIRRRLG
YAEVTDFDEA LARTVADEMG R