Gene Caul_0488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0488 
Symbol 
ID5897943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp530393 
End bp531499 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID641560971 
Productaldo/keto reductase 
Protein accessionYP_001682120 
Protein GI167644457 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.807596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTTG ACCTCTACCG TCCACTGGGC CGTTCCGGCT TGATCGTCAG CCCCCTGGCC 
CTTGGAACGA TGACCTTCGG CGTGGCCCGT TGGGGAATGG AGCGTCCAGA CGCTGAGGCC
GTGTTCGACG CCTATGTTGA GGCGGGCGGA AATCTGATCG ACACGGCGGA CGTCTACGCC
GCAGGCGCCG GGGAGACGAT GTTGGGCGAG ATGCTCGCCG AGCGCGGAAA CCGGCATGAA
CTGGTATTGG CCACGAAGTC AGGCTTCGCC ACCGGCCGCG GACCTCACGC GGGCGGCAAT
GGCGCCAAAC ATATCCATGC GGCGCTCGAG GGCTCGCTGC GCCGTCTCAA GACCGACTAT
ATCGACCTGT ACTGGATCCA CGTCTGGGAC TCGGTGACGC CTCCCGAGCA GTTGTTGGAG
ACGATGTCCG CGCTCGTCAG AGCCGGAAAG ATTCGCTATT GGGGCATGTC GAACACCCCC
GCCTGGTATG TCGCCCGCGT CGTGACTCTG GCGATGGCCC GGGGCCAGCC GGGCCCGATC
GCCCTGCAGT ATTTCTATTC TCTGGTCAGC CGCGAGATCG AGGCGGAGCA TGTTCCGCTC
GCGCTCGACA CGGGCCTTGG CGTGATGCCC TGGAGCCCGC TGGCCTATGG CCTGCTCACG
GGCAAGTACG ACAGGGCCAC GGTGGAGGCG TCGCCGTCTC GCGCCGGGGG CCTGCCCAAC
GAGGCGGCGA CTGATGGCGC CAAGGCCGAC GGCGGGGCGC GGCTGGACGG GGCCAACCCT
TTCGGCGACA CGCTGTTCAC CGAGCGCAAC TGGAAGATCG TCGACGTCCT GAAGGCCGTC
GCCTTGGAAG TCGGGGAGCA GCCCGCAAAG GTCGCGCTGG CCTGGGCCCT CTCGCGCCCA
GCGGTCGACA CCGTCCTGAT CGGCGTCAGT CGGGTCGAGC AGTTGCGGGA TAATATGGGC
GCCGTCACCC TGCGCCTAGC GGCGGAGCAT CTTGCTGCGC TGGACGAGGC TAGCCGGCCG
GCGCTCCCGA TGCTGTACGG CCTGTTCAGT GACGAGATGC GTCGGCAGGT CGTCTTCGGC
GGAGCCCCAG TGGCGACGAG ATCATAG
 
Protein sequence
MSLDLYRPLG RSGLIVSPLA LGTMTFGVAR WGMERPDAEA VFDAYVEAGG NLIDTADVYA 
AGAGETMLGE MLAERGNRHE LVLATKSGFA TGRGPHAGGN GAKHIHAALE GSLRRLKTDY
IDLYWIHVWD SVTPPEQLLE TMSALVRAGK IRYWGMSNTP AWYVARVVTL AMARGQPGPI
ALQYFYSLVS REIEAEHVPL ALDTGLGVMP WSPLAYGLLT GKYDRATVEA SPSRAGGLPN
EAATDGAKAD GGARLDGANP FGDTLFTERN WKIVDVLKAV ALEVGEQPAK VALAWALSRP
AVDTVLIGVS RVEQLRDNMG AVTLRLAAEH LAALDEASRP ALPMLYGLFS DEMRRQVVFG
GAPVATRS