Gene Caul_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2341 
Symbol 
ID5899796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2539837 
End bp2540883 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content67% 
IMG OID641562832 
Productalcohol dehydrogenase 
Protein accessionYP_001683966 
Protein GI167646303 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CAATGAAGGC GGCCGTGGTC CGCGCCTTCG GCCAGCCGCT CGTGATCGAG 
GAGGTGCGGG TTCCGCAGGT CGGCCCCGGC CAGATCCTGG TCAAGATCGC CGCCGCCGGG
GTTTGCCACA CCGATCTGCA CGCCGCGCAG GGCGATTGGC CGGTCAAGCC CAATCCGCCC
TTTATTCCAG GCCATGAGGG CGCCGGCCAT GTGGTCGCCG TGGGCGCGGG CGTCACCCAT
GTCCGGGAAG GAGACCGCGT CGGCGTGCCC TGGCTCTACT CCGCCTGCGG TCACTGCGTT
CATTGCCTCG GCGGCTGGGA GACCCTCTGC GAACTGCAGC AAAACACCGG ATATTCGGTG
AACGGCAGCT TCGCTGACTA TGTGCTCGCC GATCCCAACT ATGTCGGCCA CTTGCCCGAC
AACGTCGGCT TCGTCGAGAT CGCCCCCGTG CTGTGCGCGG GCGTCACGGT CTATAAGGGC
CTCAAGATGA CCGAGGCCAA GCCGGGCGAC TGGGTGGCCA TCTCTGGCGT CGGCGGCCTT
GGGCACATGG CTGTCCAATA CGCTAGAGCG ATGGGATTGA ACGTCGCCGC CGTCGATATC
GATGACCAGA AGCTGGCCCT GGCGCGCGCT CTTGGTGCGA CCGTGACCGT CAACGCGCTC
CACGCCGACC CGGTGGCGGT GCTCAAGAAG GAGATCGGCG GCGCCCACGG CGTCCTCGTG
ACCGCCGTCT CGCCAAAGGC CTTCGCCCAG GCGCTGGGCC TGGTGCGCAG AGGCGGCGCC
GTCGCCCTGA ATGGATTGCC GCCGGGGGAT TTCCCCCTCT CCATCTTCGA CACCGTGCTC
AACGGGATCA CCATCCGCGG TTCGATCGTC GGCACGCGGC TGGACCTGCA AGAGGCTCTG
GCCTTTGCCG GTGAGGGAAA GGTGCGCGCC ACCGTCTCGA CCGATCGGCT CGAGAACATC
AATGCAGTTT TCGACCGCAT GCGTCGCGGC GAGATCGAGG GCCGGGTCGT GCTCGACCTG
TCGGATGGAC CCCGCAGTGG TTCATGA
 
Protein sequence
MEKTMKAAVV RAFGQPLVIE EVRVPQVGPG QILVKIAAAG VCHTDLHAAQ GDWPVKPNPP 
FIPGHEGAGH VVAVGAGVTH VREGDRVGVP WLYSACGHCV HCLGGWETLC ELQQNTGYSV
NGSFADYVLA DPNYVGHLPD NVGFVEIAPV LCAGVTVYKG LKMTEAKPGD WVAISGVGGL
GHMAVQYARA MGLNVAAVDI DDQKLALARA LGATVTVNAL HADPVAVLKK EIGGAHGVLV
TAVSPKAFAQ ALGLVRRGGA VALNGLPPGD FPLSIFDTVL NGITIRGSIV GTRLDLQEAL
AFAGEGKVRA TVSTDRLENI NAVFDRMRRG EIEGRVVLDL SDGPRSGS