Gene Caul_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1687 
Symbol 
ID5899142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1774798 
End bp1775766 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content65% 
IMG OID641562177 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001683314 
Protein GI167645651 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.580617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG GGGTCCGGAC CGGGCCGGTG GTCGCCGTGA CGGGCGCCAC GGGCTTTCTG 
GGCCGACGCC TGGTCAGGAT TCTGGCCGAA GAGGGCTGGA CCGTCCGCGT CCTGGCCCGG
CGCGACATCG CCGATCCGGC ATGGCGCGGT CTCGAACTGC AACTGGCGAT CGGCGACCTG
GCCAACCCCC GCGCCCTGGC CGCGCTGTGC GACGGAGCCG AAACCGTCAT TCACGTGGCC
GGCCTGATCA AGGCGCGTTC CCGCGCGGTG TTCGACAAGG CCAACGTCGA GGGCTCGCGA
CAGGTCGCCC TGGCGGCCAA GGCGGCCGGC GCGCGACTCG TCCTGGTCTC CAGCCTCGCG
GCGCGCGAGC CTCATCTGTC GGACTACGCC GGCAGCAAGC GTGGCGGAGA AGACGCAGCG
CGCGAGATTT TTGGCGCAGA TCTGACCATT GTCCGCCCGC CGGCCATCTA CGGTCCGGGC
GACATCGAGA CGCTCAGACT GTTCAAAATG GCCTCGGAAG GGGCCTTTTT GCCGGTTTTG
GACCCGAAAG CCCGCATGGC CTGGATCCAT GCCGACGACG CCGCCGCCCG CATCGCCGCC
CTGGTCAAAA CGCCGCGCCC CGGGCTGCTC AGCTTGTCCG ACGACCGTCC CGAAGGCTAT
GGCTGGGTCG AATTGATGCA GGCTGCGTCC AAGGCGGTCG ACGCATCGCC GAGACTGGTT
CGAATCCCCT CGTGGACGAT CAAGGCTTTG GCAAACCTGT CAAAATGGGC TGCAATCGCC
ACCGGAAACG ACTCAATTCT CACCCCCGGC AAGGCGCGGG AGTTGCTTCA CGGCGATTGG
AGCCTATCTA GCAACGATCC GATTCCGGAC TTTCCCCCGG TGAGATACTC TCTCGAGGCG
GGATTCGCGC AAAGCGTGCG CTGGTATCGT TCGGAAGGTT GGTTGAAGGA TAAAAAATCT
CGAAAATAG
 
Protein sequence
MDDGVRTGPV VAVTGATGFL GRRLVRILAE EGWTVRVLAR RDIADPAWRG LELQLAIGDL 
ANPRALAALC DGAETVIHVA GLIKARSRAV FDKANVEGSR QVALAAKAAG ARLVLVSSLA
AREPHLSDYA GSKRGGEDAA REIFGADLTI VRPPAIYGPG DIETLRLFKM ASEGAFLPVL
DPKARMAWIH ADDAAARIAA LVKTPRPGLL SLSDDRPEGY GWVELMQAAS KAVDASPRLV
RIPSWTIKAL ANLSKWAAIA TGNDSILTPG KARELLHGDW SLSSNDPIPD FPPVRYSLEA
GFAQSVRWYR SEGWLKDKKS RK