Gene Caul_3241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3241 
Symbol 
ID5900696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3501445 
End bp3502356 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content68% 
IMG OID641563746 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001684866 
Protein GI167647203 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.707326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.647553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC TTGAAAACAA AGTCGTCGCC GTCACGGGGG CGGGCCGGGG CATCGGCAGG 
GCCGTCGCCC TGCTGTGCGC GGCCCAGGGC GCCAAGGTGA TCGTCAACGA TCTGGGCGGC
GGGGCGGATG GGCAGGGGCG CGACGCCGAC CCTGCCAGTC AGGTGGTCAA GGAAATCCTC
GCGGCCGGCG GTCAGGCCTA CGCCAACACC GCCAGCGTTT CAGACGCGCA GGGCGCGGCT
TCGATCATCG AGGATGCGGT CTCCCAATTT GGGCGCATCG ACGCGGTGGT CAACAACGCC
GGCTTCCTGC GCGACAGCAT CTTCCACAAG ATGGATCAGG CTGACTGGAA CGACGTCATC
GCCGTGCACC TGACCGGCTG CTTCCAGGTC TCGCGCGCGG CCGCGCCCCA TTTCAAGGCC
CAGGGCTCGG GCGCGTTCGT GCAGTTCACC TCGACGACCG GCCTGCTGGG AAACCTTGGT
CAAGCCAACT ACGCGGCCGC CAAGGCCGGC GTGGTGGGCC TGTCGACGGC CATCGCCCTG
GACATGCGGC GCTTTGGCGT TCGCTCCAAC TGTGTCGCCC CGACCGCGTG GACGCGCCTG
CTCGACACCG TCCCCGTCGA CAGCGCGGAA AAGCGCGCCG CGATGGCGCG GCTCAAGACC
CTCACGCCCG AGAAGATCGC GCCCCTGGTG GCCTTCCTTT GTTCGGACCA GGCCGCCGAT
GTCAGCGGCC AGATCTTCGG CGTACGGGGA AACGAGGTCT TCCTCTATTC GCGGCCGACC
ATCCTGCGCA CCATGCAGAT GACCGAGGGC TGGACGCCGC AGACCTGCGC CGAGGTGCTG
ATGCCCGCGC TGCGGCCCAG CTTCCAGCCC CTGTTGACCA CGCCCGAGAT CATTTCGTGG
GATCCGCAAT GA
 
Protein sequence
MTMLENKVVA VTGAGRGIGR AVALLCAAQG AKVIVNDLGG GADGQGRDAD PASQVVKEIL 
AAGGQAYANT ASVSDAQGAA SIIEDAVSQF GRIDAVVNNA GFLRDSIFHK MDQADWNDVI
AVHLTGCFQV SRAAAPHFKA QGSGAFVQFT STTGLLGNLG QANYAAAKAG VVGLSTAIAL
DMRRFGVRSN CVAPTAWTRL LDTVPVDSAE KRAAMARLKT LTPEKIAPLV AFLCSDQAAD
VSGQIFGVRG NEVFLYSRPT ILRTMQMTEG WTPQTCAEVL MPALRPSFQP LLTTPEIISW
DPQ