Gene Caul_3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3781 
Symbol 
ID5901243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4098346 
End bp4099539 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content71% 
IMG OID641564304 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001685406 
Protein GI167647743 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0115621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCATC TGGACATGAG GGAAAGCGGC GTGATGGGGG CGGGAGAGCA AAGGCTGATC 
AGCATGGTGA ACGTGGGAAC GCTCACGTTC GGACCGGGAG CGCTTGCGCG CTGCGCTGTC
GACCTGCTCG CCCGGCGGGC GCGATCGGTC TTTATCGTGA CCACCGAGCC GACGCTCTTC
CTGTGCGCAC CCCTGGTCGA TGCGCTGGAG GCGGCGGGCG TGACCGTCAC CCTCTGGCAC
GACCTAGTCG GCGAACCGAC GTTGACGGAA TTCGCCGCCG CGCTGGAGGC GACGCGCGCC
TGCCAGGCCG ACGCTGTGGT GGGCCTGGGC GGCGGCAGCG CGATGGACGT CGCCAAGCTG
GTCGCCGCGC TCGCCGACGG CAAGCAACGC ATCACCGAGG TGCTGGGCAC GGACCTGCTG
GCGGGCCGCG CCCTGTGGCT GGCGTGCATC CCGACCACGG CGGGCACCGG CAGCGAGGTG
ACGCCGATCG CCATTCTCGG CGACGAGGAC GAGGATCTGA AGAAGGGCGT CGTCAGCCCC
CATCTGGTGC CCGACGCGGC CTATCTGGAT CCGGCCCTGA CCGTAACCAT GCCACCCTCG
GTGACGGCGG CGACCGGCCT GGACGCCCTG ACCCACTGCA TCGAGGCCTA CGCCAATCGC
TTCGCCCATC CGCTGGTCGA CGTCTATGCC CTGGGCGGCA TCCGGCTGAT CGCCGAGAAC
CTCGAGCGCG CCTGCACGCA CGGCGACGAT CTGGCCGCGC GCTCGGCGAT GATGATCGCC
AGCTATTACG GCGGCCTGTG CCTGGGGCCG GTCAACACCG CGGCCGTCCA CGCCCTGGCC
TATCCGCTGG GCGGCGAGTT CCACATCGCC CACGGCGTGG CCAACGCCCT GCTGCTGCCG
CACGTGCTGC GCTTCAACAT CGAGACGACG CCGGAACGCT ACGCCGCCAT CGCGCGGGCG
CTGGGCGCCG ACTGCACCGG CGACCATGCG GCCGACGCCC TGGCCGGCGT GGAACGCGTG
ATCGCCCTGG CCGACGCCTG CGGGATCAAG CGGCGCCTGT CGGACTTTGG CATCGAACGC
CACGCGATTC CGCGGATGGC GACGGCGGCG ATGAAGGTCA CCCGCCTGCT CGACCGCAAT
CCCCGCACGC TGACCGAGGC CGACGCCCGC GCGATCTACG AAGCGGCCTG GTAG
 
Protein sequence
MAHLDMRESG VMGAGEQRLI SMVNVGTLTF GPGALARCAV DLLARRARSV FIVTTEPTLF 
LCAPLVDALE AAGVTVTLWH DLVGEPTLTE FAAALEATRA CQADAVVGLG GGSAMDVAKL
VAALADGKQR ITEVLGTDLL AGRALWLACI PTTAGTGSEV TPIAILGDED EDLKKGVVSP
HLVPDAAYLD PALTVTMPPS VTAATGLDAL THCIEAYANR FAHPLVDVYA LGGIRLIAEN
LERACTHGDD LAARSAMMIA SYYGGLCLGP VNTAAVHALA YPLGGEFHIA HGVANALLLP
HVLRFNIETT PERYAAIARA LGADCTGDHA ADALAGVERV IALADACGIK RRLSDFGIER
HAIPRMATAA MKVTRLLDRN PRTLTEADAR AIYEAAW