Gene Caul_4994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4994 
Symbol 
ID5902456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5396711 
End bp5397763 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content67% 
IMG OID641565515 
Productaldo/keto reductase 
Protein accessionYP_001686612 
Protein GI167648949 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.570117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTACG TGGAACTCGG CCGGACCGGC ATCCAGGTCT CGCGCTGCTG CCTGGGCACC 
ATGACCTGGG GCTCGCAGAA CAGCGAGGCG GAAGCCCACG AACAGATGGA CTACGCCCTG
GGGCAGGGCG TGACCTTCTG GGACACCGCC GAGATGTATT CCAGCCCGCC CAATCCCGAG
ACTCAAGGCA ATACCGAGCG CCATATCGGC TCGTGGCTGG CCAAAACCGG CAAGCGCCAG
GAGATTGTCC TGGCCTCGAA GGTGGCCGGC CGGGGCAATG CGTTCGGCGG CCTGTCGTGG
ATGCGGCCCG ACGGCGGCTC CACCCGCCAG ACCAAGGCCC AGATCGACTA CGCGGTCGAG
CAATCGCTCA AGCGCCTCAA CACCGACTAT CTCGACCTCT ACCAGCTGCA CTGGCCCGAC
CGGCCGGTGC GGGTGTTCGG CGGCCAGACC TTCAAGGACT ACGAGCAGGA CTTCGAGGCG
TTCGGCGACA TTCTCGAGGC GCTGGACGCC CACGTGAAGA AGGGGTCGAT CCGTTCGGTC
GGCGTCTCCA ACGAATTTCC GTGGGGCGTG ATGCGCTTCC TGGCCGAGGC TGAGACGCGC
GGCCTGCCGC GCATCGCCTC GATCCAGAAC GCCTACCACC TGGCTAACCG CACCTTCGAA
TACGGCCTGG CCGAGATCGC CCTGCGCGAA CAGGTGGGCC TGCTGGCCTA TTCGCCCCTG
GCCCAGGGCG CCCTGACCGG CAAGTACCTG GACGGCAAGC TGCCCGACGG TTCGCGCAAG
GCGCTCTACA ACCGCATGCA GCGCTACGAG GGTCCCGGCG CCGAGGAGGC GATCCGCGGC
TATGTGGATC TGGCCGCCCA TTTCGGCGTC GATCCGGCCC AGCTGGCCCT GAAGTTCTGC
GACACGCGGG AATTCGTCAC CGCCACCATC ATTGGCGCCA CCTCGATGGA CCAGCTGAAG
ACCAACATCG CCGCCTTCGA CCTGACCTGG ACCGAGGAGA TGGAGAGGGC GGTCAACGCC
CTGCACGCCC TGCGGCCCAA TCCGTGTCCG TGA
 
Protein sequence
MDYVELGRTG IQVSRCCLGT MTWGSQNSEA EAHEQMDYAL GQGVTFWDTA EMYSSPPNPE 
TQGNTERHIG SWLAKTGKRQ EIVLASKVAG RGNAFGGLSW MRPDGGSTRQ TKAQIDYAVE
QSLKRLNTDY LDLYQLHWPD RPVRVFGGQT FKDYEQDFEA FGDILEALDA HVKKGSIRSV
GVSNEFPWGV MRFLAEAETR GLPRIASIQN AYHLANRTFE YGLAEIALRE QVGLLAYSPL
AQGALTGKYL DGKLPDGSRK ALYNRMQRYE GPGAEEAIRG YVDLAAHFGV DPAQLALKFC
DTREFVTATI IGATSMDQLK TNIAAFDLTW TEEMERAVNA LHALRPNPCP