Gene Caul_5020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5020 
Symbol 
ID5902482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5423458 
End bp5424423 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID641565541 
Productalcohol dehydrogenase 
Protein accessionYP_001686638 
Protein GI167648975 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00109136 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCCA TACAAGCCGC CCGCACCGGC GGCCCCGAGG TTCTCGAGGC CGTCGAGCGT 
TCCGTCCCCA CGCCCGGACC TGGCCAGATC CTGGTTCGTC ATCAAGCCGT CGGCTTGAAC
TTCATCGACA CCTATCAGCG CAGCGGCCTC TACCCGATGA AGACGCCGGT CGTGCTCGGC
CTCGAGGCGG CGGGCGTCGT CGAGGAGGTC GGCGAGGACG TCACCCGGTT CAAGTTAGGC
GATCGCGTCG CCTATAACGG CACGCTCGGC GCCTATGCCG AGGCGGCCGT CGTGCCGGCC
GACCGCGCCG TGAAGGTTCC TGACGCGGTC AGCCTCGAGA CCGCGGCGGC CGTCCTGCTG
AAGGGCATGA CCGCGGAGTT TCTGGTCCAG CGTTGCCACA GGGTCGAACC CGATCAAACC
GTGTTGATCC ATGCGGCGGC GGGCGGGGTT GGCTCGATCC TGGTGCAATG GGCCAAGGCG
TTGGGGGCGA CCGTGATCGC CACCGTCGGC TCGGAAGCCA AGGCCGCCCT CGCCCGTGAC
CATGGCGCCG ACCATGTGAT CCTCTATGGC GAGGAGGACG TCGCGGCTCG GGTGTCCGAG
ATCACCGGCG GGCAAGGCGT GGCGGTCGTC TATGACGGGG TCGGCAAGGA CACCTTCGAG
GCCAGCCTCA AGAGCCTGGC TCGACGCGGT ATGCTGGTCA CCTTTGGCAA CGCCTCAGGA
CCCGTGCCGC CGTTCGCGCC GCTCGAACTG GGGAGCAAGT CGCTGTTCCT CACCCGACCG
AAGCTATTCG ACTACATCGC CACGACCGAG GAGTTGGATG AAAGCGCGGC GGCCCTGTTC
GCCGTGCTGG AGTCCGGCGC CGTGAAGATC GAGGTTGGAC AGACCTTCCC GCTCTCCGAG
GCTCGGGCCG CGCACGAAGC CCTGGAGGGT CGGCGAACGA CAGGGGCGAC GCTGCTTATT
CCGTAG
 
Protein sequence
MLAIQAARTG GPEVLEAVER SVPTPGPGQI LVRHQAVGLN FIDTYQRSGL YPMKTPVVLG 
LEAAGVVEEV GEDVTRFKLG DRVAYNGTLG AYAEAAVVPA DRAVKVPDAV SLETAAAVLL
KGMTAEFLVQ RCHRVEPDQT VLIHAAAGGV GSILVQWAKA LGATVIATVG SEAKAALARD
HGADHVILYG EEDVAARVSE ITGGQGVAVV YDGVGKDTFE ASLKSLARRG MLVTFGNASG
PVPPFAPLEL GSKSLFLTRP KLFDYIATTE ELDESAAALF AVLESGAVKI EVGQTFPLSE
ARAAHEALEG RRTTGATLLI P