Gene Caul_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4011 
Symbol 
ID5901473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4342373 
End bp4343269 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content72% 
IMG OID641564532 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001685634 
Protein GI167647971 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0205558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGC TGGATGGACA TGTCGCGATC GTCACGGGGG CGGGCGGGGG ACTGGGCCGC 
GCCCATGCGC TCTATCTCGC CAGCCAGGGC GCGCGGGTGG TGGTCAACGA CCTGACCCAG
GACGCGGCCG ACCGCGTGGC CGCCGAGATC ACGGCGCGCG GGGGCCAGGC GATCGGCGTC
GCGGCCTCGG TCACCGACGA GGCGGCGGTC GGCGCGATGG TCCGCCAGGT CATCGACCGC
TGGCGGCGGA TCGACATCCT GGTCAGCAAC GCCGGCATCC TGCGCGACAA GAGCTTCGCC
AAGATGAGCC TGGACGACTT CCGCCTGGTG GTCGACGTCC ACCTGATGGG CGCGGTGGTC
TGCGCCAAGG CGGTGTGGGA CGTCATGCGC GAGCAGCGCT ACGGCCGCAT CGTGATGACG
ACCTCCTCGT CGGGCCTCTA CGGCAATTTC GGCCAGGCCA ACTACGGCGC GGCCAAGATG
GCCCTGGTGG GGCTGATGCA GACCCTGGCC ATCGAGGGCG AGAAGTACGG CGTTCGCGTC
AACTGCCTGG CTCCCACGGC GGCGACCGGC ATGACCGAGG GCGTGTTGTC GCAGGCCAGT
CTCGAACGCC TCGATCCCAC CCTGGTCAGC CCGGGCCTGC TGGCCCTGGT GGTCGAGGGC
GCCCCGACCC GGGCCATCCT GTGCGCCGGC GCCGGCCACT TCGCCACCGC CAACATCACC
TTGACCGAAG GCCGCTATGT CGGCGACGCT CCCGACGCGG GCGAGCAGGT GATCCGCCAA
TGGGAGGCGG TTTCCGAGCG GGCCGGCGAG ATCGTCCCGG CCTACGGTTT CGCCCAGGCC
GAGCGCGAGC TGGCCAGCGC CGGCCTGATC GCCGCCGTCG CCGCGGAGCG GGCGTGA
 
Protein sequence
MLLLDGHVAI VTGAGGGLGR AHALYLASQG ARVVVNDLTQ DAADRVAAEI TARGGQAIGV 
AASVTDEAAV GAMVRQVIDR WRRIDILVSN AGILRDKSFA KMSLDDFRLV VDVHLMGAVV
CAKAVWDVMR EQRYGRIVMT TSSSGLYGNF GQANYGAAKM ALVGLMQTLA IEGEKYGVRV
NCLAPTAATG MTEGVLSQAS LERLDPTLVS PGLLALVVEG APTRAILCAG AGHFATANIT
LTEGRYVGDA PDAGEQVIRQ WEAVSERAGE IVPAYGFAQA ERELASAGLI AAVAAERA