Gene Caul_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2449 
Symbol 
ID5899904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2671288 
End bp2672283 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content70% 
IMG OID641562940 
Productzinc-binding alcohol dehydrogenase family protein 
Protein accessionYP_001684074 
Protein GI167646411 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.185569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCGA TGGTTCTGGA GGGCGGCCGG CTGACGCCCG CCCTGCGTCA CACGCCAAGG 
CCCGGCCCGG GCGAAGTCCT GCTCCGCGTG CGGACTTGCG GCGTCTGCCG CACGGATCTT
CATCTCCTGG AAGGCGATCT TCCTGTTCAG GACGGCGTCA TTCCGGGGCA CGAGATCGTC
GGCGTCGTGG AGGCGCTCGG GCAGGGCGTC ACGACTCTCG ACCTGGGGCA ACGGGTCGGC
GTCCCGTGGC TCGGCGGGGC CTGCGGGCGT TGCCGATTCT GTCGCCAGGG CGCGGAAAAC
CTCTGCGATC ACGCGCGGTT CACCGGCTGG ACGCGCGACG GCGGCTATGC CGAAATGACA
GTCGCGGATG CGCGTTTCTG CTTCGTCCTG CCCGATGAGC TCGGCGATCT TGAGGCCGCT
CCGCTGCTGT GCGCCGGTCT GATCGGCTTT CGTGCGTGGC GCAAGGCGAT GGAGGGGCGG
GTCGTTGATC GACTGGGTCT CTATGGCTTT GGCGCCGCCG CCCACCTCCT GGCCCAACTG
GCGATCGCCG AGGGGCAAAA AATCTACGCC TTTACCAAGC CGGGTGATCT GGCCGCGCAG
GATCTGGCGT TGGAACTGGG CTGCCTGTGG GCGGGGGCGT CGGACGTCGC GCCGCCCGAA
CCGCTGGACG CAGCCATCCT GTTCGCGCCG ATCGGCGCGC TCGTGCCGCT CGCCTTGCGG
GCGGTCCGCA AGGGCGGCGC CGTGGTGTGC GCGGGAATCC ACATGAGCCA GATCCCGGCC
CTGGACTATG CCGACCTTTG GGGCGAGCGG ACCCTGGTCT CGGTGGCCAA TCTCACGCGC
GCCGACGCCC AGGACTATCT GCCGCGCGCC GCCGCCGCGG GCGTTCGCCC GCACGTCAAG
GTCTACGGTC TGCGGCAGGC CCCTCAGGCG CTCGCCGACT TGCGCGCCGG CGCCTTCACG
GGGGCGGCTG TGCTGCGGAT CGATCCGCCG CTGTGA
 
Protein sequence
MLAMVLEGGR LTPALRHTPR PGPGEVLLRV RTCGVCRTDL HLLEGDLPVQ DGVIPGHEIV 
GVVEALGQGV TTLDLGQRVG VPWLGGACGR CRFCRQGAEN LCDHARFTGW TRDGGYAEMT
VADARFCFVL PDELGDLEAA PLLCAGLIGF RAWRKAMEGR VVDRLGLYGF GAAAHLLAQL
AIAEGQKIYA FTKPGDLAAQ DLALELGCLW AGASDVAPPE PLDAAILFAP IGALVPLALR
AVRKGGAVVC AGIHMSQIPA LDYADLWGER TLVSVANLTR ADAQDYLPRA AAAGVRPHVK
VYGLRQAPQA LADLRAGAFT GAAVLRIDPP L