Gene Caul_0686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0686 
Symbol 
ID5898141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp749666 
End bp750640 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content69% 
IMG OID641561168 
Productoxidoreductase 
Protein accessionYP_001682317 
Protein GI167644654 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC GCATCACGTC GCCCTTCGGG GCCTATACCG ACGCCCGCGA CGTGGTCGCC 
GGCCATGACC TGACCGGCAA GGTCGCCATC GTCACCGGCG GCGCCACCGG CATCGGGATC
GAGACCGCCC GCGCCCTGGC CCAGGCCGGG GCCGAGGTGG TGATCGCCGT CCGCAAGCCC
GACCTCGCCG AGGCCGCCGT GGCCGAGATC AACAAGACCG CCAAAGGCGC CAAGGCGAGC
TGGTCGATGC TGGACCTCGC CAGTTTCAAG TCGATCCGCG CCTTCGTGGA GCGCTGGGGC
GACCGGCCGC TGAACCTGCT GATCAACAAC GCCGGGGTCA TGGCCTGCCC GCTCGCCTAT
ACCGAGGACA GGCTGGAGAT GCAGATCGGG ACCAACCATT TCGGCCATTT TCTGCTGTCG
GTGCTTCTGG CCCCGAACCT GGTGGCGGGC GCCAAGGCTT CGGGCAAGGC CTCGCGCCTG
GTGTCGCTGT CGTCGATCGG TCACCGCCGC GCGCCGATGA ACTTCGATGA CCCGCATTTT
CGCTCGCACC CCTACGACAA GTGGGAAAGC TACGGCCAGG CCAAGACCGC CAACGCCCTG
TTCGCGGTCG GCTTCGACAA GCGCTTCAAG GACCAGGGCG TGCGCGCCTT CTCGGTGATG
CCGGGCGGCA TCATGACCCC GCTGCAGCGC CACTTGCCGA TCGAGGAACA GGTCGCCATG
GGCTGGATCG ACGAGCACGG CAAGGTCCGC GACGGCTTCA AGACGCCCCA GCAGGGGGCC
TCGACCAGCG TCTGGGCCGC CGTCGGCGAC GAGCTGGAGG GCGCCGGCGG GCTCTATCTG
GAAGACCTGG CCCAAGCCGC GCCGTGGACC AAGCAGTCCG GCTGGTCCGG CGTCATGCCC
CATGCCCTGG ATCCCGAGGC GGCCGACCGG CTCTGGACCC TGTCGGTCGA AACCACCGGC
GCGGGCGCGG CGTGA
 
Protein sequence
MTDRITSPFG AYTDARDVVA GHDLTGKVAI VTGGATGIGI ETARALAQAG AEVVIAVRKP 
DLAEAAVAEI NKTAKGAKAS WSMLDLASFK SIRAFVERWG DRPLNLLINN AGVMACPLAY
TEDRLEMQIG TNHFGHFLLS VLLAPNLVAG AKASGKASRL VSLSSIGHRR APMNFDDPHF
RSHPYDKWES YGQAKTANAL FAVGFDKRFK DQGVRAFSVM PGGIMTPLQR HLPIEEQVAM
GWIDEHGKVR DGFKTPQQGA STSVWAAVGD ELEGAGGLYL EDLAQAAPWT KQSGWSGVMP
HALDPEAADR LWTLSVETTG AGAA