Gene Caul_4830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4830 
Symbol 
ID5902292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5227265 
End bp5228263 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content68% 
IMG OID641565350 
Productalcohol dehydrogenase 
Protein accessionYP_001686448 
Protein GI167648785 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGG TGCTCAGCAA GGCGGTCGGC GGACCCGAGA CCCTGGTGCT GGAGGAGCTT 
CCCGACCCCG TCGCCGGTCC GGGCCAGGTG CTGCTGGAGA TCAAGGCCTG CGGCGTCAAC
TATCCCGACG TGCTGATCAT CGAGGACAAG TACCAGTTCA AGCCCGAGCG GCCGTTCGCG
CCAGGCGGCG AGGTTTCGGG CGTGGTGCTG GCCCTGGGCG AGGGCGTGAC GACCCTGAAG
GTCGGCCAGC GCGTGCTGGC CTCGACCGGT CACGGCGGCA TGGCCGAGAA GGTGGCGCTG
GACGCCATGC GCTGCACGCC CATTCCCGAC AACATGCCCT TTGATGAAGC CGCCGCCTTC
ATCCTCACCT ACGGCACGTC TTACTATGCC CTGAAGGACC GCGGCCATCT GAAGGCCGGC
GAGACCCTGC TGGTGCTGGG CGCGGCCGGC GGCGTCGGCC TGGCGGCCGT CGAGCTGGGC
AAGGCGGCCG GGGCGCGGGT CATCGCCGCC TGCTCTAGCC AGGAGAAGGT GGACCTGGCG
ATCAAGCACG GCGCCGACGC CGGCGTGGTC TATCCGCGCG GTCCGTTCGA CAAGGACGGT
CAGAAGGCCC TGGCGACCCT GATCAAGGAG GCCTGCGGGC CGAACGGCTG GGACGTGGCC
TATGACGCGG TCGGCGGCGA CTATTCCGAA GCCACGATCC GCGCCGCCGG CTGGAACGGC
CGCTTCCTGG TCATTGGCTT CCCGTCGGGC ATTCCGAAGA TCCCGCTGAA CCTGACCCTG
CTGAAGTCCT GCGACATCGT CGGGGTGTTC TGGGGCGCCT CGGTGGCCCG CGATCCCAAG
GGCCACGCCC AGAACGTGCG CGAGCTGATG GATCTGTACC AGGCCGGCAA GATCAAGCCC
TATGTCTCCG AACGCTTTCC CTTGGAGAAG GCCGGCGACG CCATCGCCCA CCTGGCCAGC
CGCAAGGCCA TGGGCAAGGT CGTGGTGGTC ACGGACTAG
 
Protein sequence
MKAVLSKAVG GPETLVLEEL PDPVAGPGQV LLEIKACGVN YPDVLIIEDK YQFKPERPFA 
PGGEVSGVVL ALGEGVTTLK VGQRVLASTG HGGMAEKVAL DAMRCTPIPD NMPFDEAAAF
ILTYGTSYYA LKDRGHLKAG ETLLVLGAAG GVGLAAVELG KAAGARVIAA CSSQEKVDLA
IKHGADAGVV YPRGPFDKDG QKALATLIKE ACGPNGWDVA YDAVGGDYSE ATIRAAGWNG
RFLVIGFPSG IPKIPLNLTL LKSCDIVGVF WGASVARDPK GHAQNVRELM DLYQAGKIKP
YVSERFPLEK AGDAIAHLAS RKAMGKVVVV TD