Gene Caul_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1906 
Symbol 
ID5899361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2044252 
End bp2045274 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content70% 
IMG OID641562396 
Productalcohol dehydrogenase 
Protein accessionYP_001683533 
Protein GI167645870 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.466553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAGCCC TTCTCAGTCT GCAGGCCGGC GGCCCCGAAA CCTTAAGGCT TTGCGACATC 
GACGAGCCCT CGCCGGGGCC GGGTCAGGTC GCCGTGGCCG TGCAGGCCTG CGGGGTCAAT
TTCCCGGATC TGCTGGTCAT CGAGGACAAA TACCAGCTGC GGCCGCCCCG GCCGTTTTCG
CCCGGCAGCG AGCTGGCGGG CGTGGTGCGC GCGATCGGCG AGGGCGTCGT GTCGCCGCGC
GTGGGCGAGC GCGTTTCCGC GTCATTGCCG TTTGGCTCGA TGGCGGAGGT CGTCCTTGTG
CCGGCGGACC GCTGCACGGT CGTGCCCGAC GCCATGCCGT TTGAGGAAGC CGCGGCGTTC
CAGGTGACCT ATGGCACCGT CTACCATGCC TTGGTGGCGC GCGCGGGGAT GCGGCTAGGC
GAGACGCTGC TGGTCCTCGG CGCCGGGGGC GGCGTCGGCC TGGCGGCCGT GGAGTTGGGC
AAGACCCTCG GCGCTCGCGT GGTGGCGGCC GCGTCTTCAC AAGACAAGCT GGACGCGGCG
CGGGCGATGG GCGCGGATGC GGTGGTGTTG TATCCGCGCG CGTTCGAGGA TGTCGCTCAG
GCCCGGGCCT TCACCGAGGC GGTGCGGGCC GCTTGTGGCG GGGAAGGGGC CGATGTCGTG
GTCGACCCGG TCGGCGGCGC CTATGCCGAG CCGGCGTTCC GGTCCATTGC GGGGCAGGGG
CGGTACCTGG TTGTGGGTTT TGCGGCCGGC ATTCCCGCCA TTCCGCTCAA TCTCGTCCTG
CTCAAAGCCG CGCAGATCAT CGGCGTCTTC TGGGGCGCCT ACATGGCGCG TGATCCGCAG
GCGGGCGCTC GGGACATGAA GGCGTTGTTG GATCTCTATA CCGCCGGCAA GATCAGGCCG
CGCGTCTCCG AGCGCTATGG CTTGGAGCAG GGCGGGGAGG CGATCGCGCG GTTGGGGTCG
CGCGGCGTCT CGGGCAAGAT CGTCGTGGAC GTGAATGGTT CTGAAAGCAG AACGAGTACC
TAA
 
Protein sequence
MRALLSLQAG GPETLRLCDI DEPSPGPGQV AVAVQACGVN FPDLLVIEDK YQLRPPRPFS 
PGSELAGVVR AIGEGVVSPR VGERVSASLP FGSMAEVVLV PADRCTVVPD AMPFEEAAAF
QVTYGTVYHA LVARAGMRLG ETLLVLGAGG GVGLAAVELG KTLGARVVAA ASSQDKLDAA
RAMGADAVVL YPRAFEDVAQ ARAFTEAVRA ACGGEGADVV VDPVGGAYAE PAFRSIAGQG
RYLVVGFAAG IPAIPLNLVL LKAAQIIGVF WGAYMARDPQ AGARDMKALL DLYTAGKIRP
RVSERYGLEQ GGEAIARLGS RGVSGKIVVD VNGSESRTST